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The Economics of Non-Convex Ecosystems: 
Introduction 
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^Cambridge University, Faculty of Economics, Sidgwick Avenue, Cambridge CBS 9DD, UK 
(E-mail: partha.dasgupta@econ.cam.ac.uk); ^Beijer International Institute of Ecological 
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Abstract. The word “convexity” is ubiquitous in economics, but absent from economics. In this 
paper we explain why, and show what difference it makes to economic analysis if ecosystem 
non-convexities are taken seriously. A simple proof is provided of the connection between “self- 
similarity” and “power laws”. We also provide an introduction to each of the papers in the 
Symposium and draw out the way in which they form a linked set of contributions. 

Key words: bifurcation points, environmental Kuznets curve, hysteresis, irreversibility, non- 
convexity, Pontryagin Principle, power laws, property rights, separatrix, structural stability, 
thresholds 



1. Economists’ Convexities and Nature’s Non-Convexities 

The word “convexity” is ubiquitous in economics, but absent from ecology. There 
is a reason for each. As prices are prominent in modem transactions, it is but natural 
that we would wish to uncover the ways in which the price system is capable of 
functioning as a resource allocation mechanism. In recent years economists have 
identi ed not only the way in which prices aggregate dispersed pieces of informa- 
tion, but also the sense in which they re ect the relative scarcities of goods and 
services. We now know that the price system can be an ef cient allocation mecha- 
nism if transformation possibilities among goods and services - in and over time - 
constitute a convex set. ' However, except in the case of partial economic systems, 
as in models of industrial organization (Tirole 1993), or of systems harbouring 
very speci c forms of non-convexities, as in modem growth models (Jones 1998) 
or in models of poverty traps based on the connection between nutritional status 
and human productivity (Dasgupta and Ray 1986, 1987; Dasgupta 1997) or in 
models of spatial economies (Fugita, Kmgman and Venables 1999), we still do 
not have a clear understanding of the mechanisms by which resources are allocated 
in non-convex environments. So we economists continue to rely on the convexity 
assumption, always hoping that it is not an embarrassing simpli cation. 

Ecologists have no comparable need to explore the stmcture of convex 
sets. They are interested in identifying pathways by which the constituents of 
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an ecosystem interact with one another and with the external environment. A 
large body of empirical work has revealed that those pathways in many cases 
involve transformation possibilities among environmental goods and services that, 
together, constitute non-convex sets. Often the non-convexities re ect positive 
feedbacks in Human-Nature interactions. Mathematical ecologists therefore study 
the structural stability of ecosystems and the sizes and shapes of their basins 
of attraction for given sets of environmental parameters.^ Such notions as the 
“resilience” of ecosystems are expressions of this research interest.^ 

The price mechanism is especially problematic in economic systems charac- 
terised by positive feedback processes. We now know that in such environments 
it may prove impossible to decentralise an ef cient allocation of resources by 
means exclusively of prices. Ef cient mechanisms would typically involve addi- 
tional social contrivances, such as (Pigouvian) taxes and subsidies, quantity 
controls, social norms of behaviour, and so forth. This was proved formally in 
a justly famous article by Starrett (1972), who showed that for certain types 
of non-convexities associated with environmental pollution, a competitive price 
equilibrium simply does not exist: markets for pollution would be unable to equate 
demands to supplies. Starrett’s non-convexities are present when losses traceable to 
environmental pollution are bounded. If the market price for pollution in such situa- 
tions were negative (i.e., the polluter has to pay the pollutee), pollutees’ demand 
would be unbounded, while supply would be bounded. On the other hand, if the 
price were non-negative, demand would be zero, while supply, presumably, would 
be positive."^ 

Starrett’s nding implied that private property rights to environmental pollution 
would not be capable of sustaining an ef cient allocation of resources by means 
of the price system. In a subsequent note, Starrett (1973) demonstrated by means 
of an example that if property rights are awarded to polluters, even such a non- 
price resource allocation mechanism as the core may not exist. But he showed 
(Starrett 1972) that a suitably chosen set of pollution taxes, together with a system 
of competitive markets for other goods and services (assuming that the latter consti- 
tute a convex sector), would be capable of supporting an ef cient allocation of 
resources. As there are no markets for pollution in such an allocation mechanism, 
the problem of equating supply to demand in pollution activities is bypassed. The 
moral would appear to be that social dif culties arising from the non-convexities 
can be overcome if the State were to assign property rights in a suitable way - 
permitting private rights to the convex sector, but reserving for itself the right to 
control emissions and discharges.^ 



2. Ecological Thresholds and the Environmental Kuznets Curve 

Despite the ecologist’s strictures, we economists have remained ambivalent toward 
Nature’s non-convexities. Often enough, that ambivalence reveals itself only indir- 
ectly. For example, it is even today commonly thought that, to quote an editorial 
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in the UK’s The Independent (4 Deeember 1999), . . (economie) growth is good 

for the environment because countries need to put poverty behind them in order to 
care”; or, to quote The Economist (4 December 1999, p. 17), . . trade improves 

the environment, because it raises incomes, and the richer people are, the more 
willing they are to devote resources to cleaning up their living space”. 

The view’s origin can be traced to World Bank (1992), which observed an 
empirical relationship between GNP per head and atmospheric concentrations of 
industrial pollutants. Based on the historical experience of OECD countries, the 
authors of the document suggested that, when GNP per head is low, concentra- 
tions of such pollutants as the sulphur oxides increase as GNP per head increases, 
but that when GNP per head is high, concentrations decrease as GNP per head 
increases further.^ Among economists this relationship has been christened the 
“environmental Kuznets curve”.^ The moral that would appear to have been drawn 
from the nding is that resource degradation is reversible: degrade all you want 
now. Earth can be relied upon to rejuvenate it later if you require it. 

The presumption is false. Nature’s non-convexities are frequently a manifes- 
tation of positive feedback processes, which in turn often means the presence of 
ecological thresholds. But if a large damage were to be in icted on an ecosystem 
whose ability to function is conditional on it being above some threshold level 
(in size, composition, or whatever), the consequence would be irreversible.^ The 
environmental Kuznets curve was detected for mobile pollutants. Mobility means 
that, so long as emissions decline, the stock at the site of the emissions declines. 
However, reversal is the last thing that would spring to mind should a grassland ip 
to become covered by shrubs, or should the Atlantic gulf stream shift direction or 
come to a halt, or should a source of water disappear, or should an ocean shery 
become a dead zone owing to over shing. As a metaphor for the possibilities of 
substituting manufactured and human capital for natural capital, the relationship 
embodied in the environmental Kuznets curve has to be rejected.^ 

Although non-convexities are prevalent in global ecosystems (e.g., ocean circu- 
lation, global climate), it is as well to emphasise the spatial character of many 
positive feedback processes. The latter have a direct bearing on the rural poor 
in the world’s poorest regions. Eutrophication of ponds, or salinization of soil, 
or biodiversity loss in a forest patch involve crossing ecological thresholds at 
a spatially localised level. Similarly, the metabolic pathways between an indi- 
vidual’s nutritional status and his or her capacity to work, and those between 
a person’s nutritional and disease status involve positive feedback.'*’ Studies of 
extreme poverty based on aggregation at the regional or national level can therefore 
mislead greatly. The spatial con nement of many of the non-convexities inherent 
in Human-Nature interactions needs always to be kept in mind. 

The connection between rural poverty in the world’s poorest regions and the 
state of the local ecosystems should be self-evident. When wetlands, inland and 
coastal sheries, woodlands, forests, ponds and lakes, and grazing elds are 
damaged (owing, say, to agricultural encroachment, or urban extensions, or the 
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construction of large dams, or organizational failure at the village level), traditional 
dwellers suffer. For them - and they are among the poorest in society - there 
are frequently no alternative source of livelihood. In contrast, for rich eco-tourists 
or importers of primary products, there is something else, often somewhere else, 
which means that there are alternatives. Whether or not there are substitutes for a 
particular resource is therefore not only a technological matter, nor a mere matter 
of consumer taste: among poor people location can matter too. The poorest of the 
poor experience non-convexities in a way the rich do not. Even the range between 
a need and a luxury is context-ridden. Macroeconomic reasoning glosses over the 
heterogeneity of Earth’s resources and the diverse uses to which they are put - by 
people residing at the site and by those elsewhere. ' ' 



3. Motivation Behind this Symposium 

Despite their prevalence, we economists continue to show little interest in Nature’s 
non-convexities. For example, the now enormous literature on “green accounting” 
has been built largely on the backs of economic models in which non-convexities 
are safely out of sight. In view of this, the Beijer International Institute of Ecolo- 
gical Economics in Stockholm invited a group of ecologists and economists to 
study the economics of non-convex ecosystems. Beginning in 1998, the group met 
regularly. We knew that much fundamental work had already been accomplished 
on non-linear dynamical systems. Some of it had already been used to study the 
economics of non-convex social environments.^^ Our idea therefore was to use 
these techniques so as to study proto-typical Human-Nature interactions. Speci c 
applications were very much in our mind. 

As our understanding of the issues improved, we felt it would be desirable too if 
the articles we prepared were written expansively. So we encouraged ourselves also 
to review what has already been achieved in the subject. We felt that the collection 
could then serve as a self-contained body of work, useable in graduate courses in 
environmental and resource economics, perhaps useful as well to researchers who 
wish to work in this eld. In order to increase accessibility to the collection, we 
felt it would be appropriate to publish it in a journal, rather than as a book. We are 
therefore most grateful to the members of the Editorial Board of Environmental and 
Resource Economics, especially Ian Bateman and Kerry Turner, for the enthusiasm 
they have shown toward our enterprise from the time we approached them, and for 
the encouragement they have given us to produce the collection in the form that it 
now appears. 
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4, Macro Regeneration Functions of Ecosystems 

4.1. PRODUCTION FUNCTIONS AND COMMODITY POSSIBILITY SETS 

In contrast to the production scale-economies studied in traditional price theory, the 
non-convexities associated with ecological thresholds manifest themselves across 
time. Nevertheless, there is a formal connection between the two types of non- 
convexities. 

To see this, consider the economist’s old stand-by, a one-commodity, constant 
population economy, where the commodity in question is durable and non- 
deteriorating. Time is continuous and the economy is deterministic.'^ Let Kt (> 0) 
and Ct (> 0) be the “eapital” stoek and the flow of consumption, respectively, at 
t (> 0). Output is assumed to be given by the produetion function F(K), where F 
is differentiable everywhere, F(0) = 0 and F(K) > 0 for some K > 0. An extreme 
assumption, much used in general equilibrium theory, is that output is freely dispos- 
able. Subsequently we will see what the assumption involves and why its violation 
is significant for the subject of this Symposium.'^ 

Let It denote investment at t. Sinee output is freely disposable, we may express 
the balance of flows at eaeh date in the economy by the inequality. 

It < F(Kt) — Ct, for t > 0, and Kq (> 0) is given, (la) 

which, without rigorous justification, we write as, 

dKt/dt < F(Kt) — Ct, for t > 0, and Kq (> 0) is given. (Ib) 

A programme is a complete foreeast of the eeonomy, from the present (t = 0) 
to infinity. A programme can be expressed as {Kt, Ct, dKt/dt}o“. For simplicity of 
exposition, we suppose that any programme satisfying (Ib) is feasible; whieh is to 
say that the eeonomy is not subject to any other eonstraint. 

The question is whether the set of feasible programmes is eonvex. 

The answer depends on whether F is concave everywhere (heneeforth, concave). 
To confirm this, assume that F is eoneave. Consider any two feasible programmes, 
which we write as (Kt*, Ct*, dKt*/dt}o“ and (Kt', Ct', dKt'/dt}o“. By definition, 
both satisfy (lb). Henee, 

dKt*/dt < F(Kt*) - Ct*, for t > 0, and Kq* = Kq, (2) 

and dKt'/dt < F(Kt') — Ct', for t > 0, and Kq' = Kq. (3) 

Now choose a number y , where 0 < y < 1 . Define 

Ct = yCt* + (1 - y)Ct' and Kt = yKt* + (1 - y)K/. (4) 

We wish to confirm that (Kt, Ct, dKt/dt}o°° also satisfies (lb). 

From (2) and (3), we have, 

ydKt*/dt < yF(Kt*) - yCt*, for t > 0, and Kq* = Kq, (5) 

and (1 - y)dK//dt < (1 - y)F(K/) - (1 - y)C/, for t > 0, and Kq' = Kq. (6) 
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Since F is concave, 

F(K0 = F(yKt* + (1 - y)K') > yF(K,*) + (1 - y)F(K/). (7) 

From (2)-(7) we conclude that 

dKt/dt < F(Kt) - Ct, for t > 0, and Kq = Kq, (8) 

which means that the set of programmes satisfying (lb) is convex. 

If F(K) is not concave, the above argument does not work, because inequality 
(7) cannot be guaranteed. In fact, if F(K) is not concave, it is possible to find a 
pair of programmes that satisfy inequality (lb) and a convex combination of the 
two that does not satisfy inequality (lb). This means that if the function F(K) is not 
concave, the set of feasible programmes is not convex. 

4.2. WELFARE ECONOMICS IN A CONVEX WORLD 

Let us review welfare economics in a world where F(K) is concave. Both consump- 
tion and the capital stock are assumed to be “goods” (i.e., neither is a pollutant), 
and we imagine that the flow of social welfare at t is a strictly concave function, 
U(Ct, Kt). U is assumed to be twice partially differentiable everywhere. Let 8 (> 0) 
be the utility rate of discount. Intertemporal welfare is taken to be of the form, 

OO 

U(Ct, Kt) exp(— ^t)dt, where (5 > 0. (9) 

The standard problem in welfare economics is to locate that programme which 
maximizes (9) subject to (lb). But since output is a good and there are no bad 
byproducts, it would be silly, ever, to dispose of any output, which is another 
way of saying that the shadow price of output is positive. So we may express the 
optimization problem as being one of locating that programme which maximizes 
(9) subject to: 



dKt/dt = F(Kt) — Ct, for t > 0, and Kq (> 0) given. (10) 

As is now hugely familiar, the problem lends itself to analysis by the techniques 
developed by Pontryagin. In view of the assumptions that have been made about 
F and U, we know that the Hamiltonian of the maximization problem is a concave 
function of the state and control variables. This means that the optimum can be 
implemented with the help of intertemporal accounting prices in a decentralized 
economic environment, the point to which we drew attention in Section 1 of the 
Introduction. 

The literature on optimum economic growth has shown that if U depends only 
on C, the Pontryagin conditions yield a unique stationary point (a saddle point), the 
target that ought to be aimed at no matter what is the value of Kq. To the best of 
our knowledge, Kurz (1968) was the first to note that if U depends not only on C, 
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but also on K (as is the case of the U postulated here), the Pontryagin optimality 
conditions may possess multiple stationary points, meaning that in the space of K 
and its co-state variable, there are multiple basins of attraction.^* Kurz also showed 
that the long run features of the optimum programme depend on Kq. In other words, 
history matters. We conclude that even when the Hamiltonian of an optimization 
problem is concave in the state and the control variables, the Pontryagin optimality 
conditions can have multiple stationary points. 

In the economics of forestry it has been common to assume that the growth 
function of a tree’s biomass is concave. Denoting the biomass of a tree by K, it is 
assumed that K’s growth function is quadratic. If trees are not harvested, biomass 
accumulates as. 



dKt/dt = F(Kt) = aKt - bKt^ a, b > 0, and Kq > 0. (1 1) 

Equation (11) possesses two stationary points: K = 0 and K = a/b. The former is 
unstable, while the latter is stable. The system therefore possesses a single basin of 
attraction. Starting with a seedling, a tree’s biomass grows as a logistic function. 
This gives rise to the famous Faustmann Rule for the age at which a synchronized 
forest should be felled for its timber and then reseeded immediately. Underlying 
Faustmann’s Rule is the assumption that a forest is valuable only for its timber, 
which means that tree biomass has no direct worth (say, as a habitat for biodi- 
versity). As is well known, the biomass felled according to the Rule, at regular 
intervals, is less than a/2b (the point at which F(K) is maximum). Given that F in 
equation (11) is concave, the Faustmann Rule can in principle be implemented by 
a price system.'^ 



4.3. CONVEX-CONCAVE F 

The famous Bretherton-Holt regeneration function for fish biomass takes the form 
of equation (11). However, that F(K) may not be concave was realized many years 
ago by fisheries experts. Denofing fhe biomass of a single-species fishery by K, 
if is commonly assumed in fisheries economics fhaf F(K) is convex-concave, fhaf 
is, F(K) is convex at low values of K, but concave beyond some value of K. A 
rigorous analysis of fhe opfimal managemenf of a convex-concave resource was, 
however, nol provided until Skiba (1978). As we shall see presenfly, several of 
fhe confribufions in fhis Symposium are economic analysis of ecosystems whose 
nafural regeneration funclions (F(K) in our nofafion) are convex-concave. 

The problem is fhaf, as F is convex-concave, feasible programmes do nof consfi- 
fufe a convex sef even if cafch is freely disposable. The price sysfem is Iherefore 
generally nol viable for implemenfing fhe opfimum programme, fhe poinf wifh 
which we began fhis Infroducfion. 
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5, Micro Foundations of Emergent Macro Properties 

Thus far we have reviewed the macro properties of some very special ecosystems. 
In a wide ranging discussion, Chave and Levin (this issue) study certain emergent 
properties of systems that are governed by processes operating at different scales 
of activity. Exploring both spatial and temporal scales, the authors show how 
natural scientists have tried to provide the micro foundations of the macro proper- 
ties of natural adaptive systems. Chave and Levin offer a rich set of examples, 
drawn from ecology, geology, and condensed matter physics, to illustrate common 
traits among adaptive systems. Of particular interest are systems that are scale 
invariant (or “self similar”). The authors note that they display power laws. The 
relationship between metabolic rate and body mass is one example of a power law, 
the allometric dependency of species number on area is another. As the proof that 
scale invariant systems display power laws is simple and intuitive, we offer it here 
for completeness. 

Suppose that two scalar quantities, x and y (both non-negative), are related by a 



scale invariant function, 

y = f«- (12) 

In order to change scales, define, 

z = ay, and w = /lx, where a, /I > 0. (13) 

Since f is scale invariant, it must be that 

z = Af(w), where A (= A(a, /!)) > 0. (14) 

But equations (12)-(14) can hold together if and only if f is homogeneous, that is, 
f(x) = Bx’^, where B > 0 and y is a constant. (15) 

(15) is the implied power function. 



As in economics, it is easier to study the macro dynamics of an ecosystem 
directly without peering at micro foundations. In the four papers that follow the 
Chave-Levin study, the authors investigate the macro dynamics of three types 
of ecosystems: savannahs, shallow lakes, and boreal forests. For tractability, the 
ecosystems are described in terms of differential equations involving only a few 
state variables. Depending on the context, the state variables are interpreted as 
resource stocks or as environmental qualities. Humanity is taken to be the user of 
the ecosystem. The models are deterministic. 
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6. Mixed Tree-Grass Ecosystems 

Scholes (this issue) explains why the funetional relationship between tree eover 
and grass eover in the African savannahs is convex for much of its range. Starting 
at zero, if tree cover were to increase, grass cover would decline, but at a rate that 
would define a convex function (at least initially), meaning that the set of grass- 
tree configurations circumscribed by the curve is non-convex. Scholes argues that 
because the economic value of the savannahs lies mainly in grass products, and 
because the main mechanism (fires) confrolling fhe balance befween grass and frees 
is dependenf on grass, fhe convex funcfional relafionship befween frees and grass 
resulfs in fhe simplesf cases in fwo sfable savannah configurafions: fhin free cover 
and dense free cover. In such cases fhe savannahs possess fwo basins of affracfion. 
A landscape of fhin free cover is able fo supporf grazing, buf one fhaf has dense 
free cover is nol able fo do so. 

The existence of multiple basins of aflracfion in non-convex systems is a 
recurrenf fheme of Ibis Symposium. While Scholes offers an explanafion for fhe 
observed landscapes of fhe savannahs (expressed in terms of trees and grass and 
the ecological pathways mediating their mix), the remaining contributions study 
Human-Nature interactions in terms of the way Nature is (or, alternatively, should 
be) managed by Human communities. In each of the systems studied here, every 
basin of attraction is attracted to an equilibrium point of the system.^' 

7, Human Intervention in Non-Convex Ecosystems 

Human intervention in an ecosystem takes the form of resource extraction or 
pollutant discharge. In positive as opposed to normative analysis the intervention is 
taken to be given, its consequences to the ecosystem are then studied. In theoretical 
work the intervention is frequently taken to be constant over time, which means 
that it can be regarded as a parameter of the ecosystem. We illustrate this by 
considering, in turn, resource extraction and pollution discharge in the context of 
two simple models. 

7.1. CONVEX-CONCAVE GROWTH FUNCTIONS: RESOURCE EXTRACTION 

Dasgupta (1982, eh. 6) constructed a model of an open access fishery. Denoting 
fish biomass by K, he assumed fhaf fhe fishery’s growfh funcfion, F(K), is 

F(K) = — a -h bK — cK^, a, b, c, (b^ — 4ac) >0, if K > 0, and (16) 
F(K) = 0 if K = 0. 

F(K), so defined, is convex-concave. (To confirm fhis, observe fhaf F(K) is 
disconfinous af K = 0.) Nofice also fhaf F(K) = 0 af fhree values: 0, Ki = 
[b — (b^ — 4ac)^/^]/2c, and K 2 = [b -|- (b^ — 4ac)^/^]/2c. Ki is a threshold: if 
biomass were fo fall below if, fhe fishery would die. 
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If C is catch, the cost of harvest is taken to be where /x > 1 and fi, 

y > 0. The idea here is that, while unit cost of harvest is an increasing function of 
harvest (there is crowding among fishermen), harvest cost is a decreasing function 
of stock (search costs decline if fish biomass increases). 

Free entry (and exit) into the fishery is assumed to be an instantaneous adjust- 
ment process, resulting in zero profit at all times. This means that if p is the market 
price of a unit of fish biomass, 

p = (17) 

Writing q = p//i and using (16) and (17), the net growth rate of fish biomass (for 
Kt > 0) is 

dKt/dt = (-a -h bKt - cKj^) - (18) 

a, b, c, /X, y, and q are parameters of the dynamical system. Since q reflects the 
fish market and an aspect of fishing technology, we shall vary q so as to study the 
qualitative properties of equation (18). 

Figure 1 is based on the right hand side of equation (18). We have drawn the 
figure for the case where y > (/x — 1). Observe that there is a critical value of 
q, call it q*, such if q = q*, the curves (—a -|- bK — cK^) and qF(M-i)Kr/(M-i) are 
tangent to each other. 

Ifq > q*, equation (18), allied to the second part of (16), has a unique stationary 
point: K = 0. This means that the fishery has a single basin of attraction and 
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is doomed: there is overfishing to sueh an extent that, in finite time, the fishery 
becomes extinct. This is the case advanced by Hardin (1968), who claimed as a 
universal rule that freedom in the commons spells ruin. But the thesis is correct 
only if, relative to the harvesting technology, the resource is valuable (q > q*). 
Even under open access, matters are different when q < q*. 

To confirm this, notice that if q < q*, the fishery has three stationary points, 
namely, K = 0 and the two stationary solutions of equation (18), which we have 
denoted as K* and K** in Figure 1. It is simple to infer from the figure that K = 
0 and K = K** are (locally) stable, while K* is unstable. K* is thus a separatrix: 
it separates the two basins of attraction, each of which contains a stationary point 
(K = 0 and K = K**, respectively). We conclude that if Kq > K*, the fishery survives 
despite free entry. Hardin’s analysis turns out to have been wrong for this case: if 
the value of fish relative to harvesting costs is low, there is not enough entry to 
pose a danger to the fishery. Because there is a qualitative break in the dynamical 
properties of the fishery under free entry at q = q*, q* is called a bifurcation point 
of the system. 

7.2. CONVEX-CONCAVE POLLUTION RECYCLING FUNCTIONS 
7.2.1. The model 

We now study a pollution problem that has been much analysed in recent years: 
phosphorus discharge into a shallow, fresh water lake. We use the occasion also to 
develop a number of additional notions. 

Consider a system whose dynamics are described by equation (10). However, 
so as to make contact with the papers by Brock and Starrett and by Maler, 
Xepapadeas, and de Zeeuw in this Symposium, we now interpret Kt (> 0) to be 
the stock of a pollutant in an ecosystem and Ct (> 0) the pollution discharge into 
the system. K can be taken to reflect the state of the ecosystem. 

It would be absurd to assume that pollutants can be disposed of freely. Kneese, 
Ayers, and d’Arge (1972) noted that mass has to be conserved, so that what is 
produced by society must eventually find a place of residence, in one form or 
another. This they formalized by requiring that economic activities must satisfy 
a “materials-balance condition”. In the present case, equation (10) is an equation 
(unlike equation (lb), not a weak inequality), because pollution cannot be disposed 
of freely. 

Assume that Ct = C, a constant. C is the human intervention. The pollution’s 
dynamics are therefore given by the equation, 

dKt/dt = F(Kt) + C, for t > 0, and Kq (> 0) given. (19) 

F(K) represents the net natural regeneration rate of the pollutant. One of the 
services ecosystems provide us with is the breaking down of pollutants. Organic 
pollutants are decomposed by microbes, atmospheric carbon dioxide gets absorbed 
by the oceans, and so forth. Such natural forms of decay are included in F. But there 
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are cases (as with phosphorus in shallow lakes; see below) where the pollutant 
has a limited ability to grow as well. So, in what follows we assume that (i) F is 
differentiable everywhere, (ii) F(0) = 0, (iii) F(K) > 0 for some K, and (iv) F(K) < 
0 for all K sufficiently large. 

The term resilience refers to the stability of ecosystems. It has been used alter- 
natively to denote the size of the basin of attraction in which it currently resides and 
the speed of convergence back to its current state following a perturbation to it. In 
order to explore such problems, it pays first to study the dynamics of the ecosystem 
when C = 0. In this case, 

dKt/dt = F(Kt), for t > 0, and Kq (> 0) given. (20) 

K = 0 is a stationary point of the system. How many other stationary points does 
the system possess? 

If F is concave, there is precisely one other stationary point, and it is stable, 
while K = 0 is unstable. But if F is not concave, equation (20) can have any number 
of stationary points. In view of the conditions we have imposed on F, the function 
cannot be convex everywhere. So we look for convex-concave forms. 

Consider the simplest convex-concave form, namely, 

F(K) = bKV(l + K^) - AK, where b, A > 0. (21) 

The positive feedback in the regeneration process governing K is given by the 
first term on the right hand side (RHS) of equation (21), which, as can be readily 
confirmed, is convex-concave, with an upper bound of b. The second term on 
the RHS of (21) is the rate at which the ecosystem is cleansed of the pollutant. 
Combining (19) and (21) we have 

dKt/dt = C -h bKtV(l + Kt^) - AKt, Kq (> 0) given.^^ (22) 

Equation (22) contains three parameters: C, b, and A. We wish to know how the 
ecosystem’s character depends on them. One expects that mostly the global proper- 
ties of the ecosystem would vary continuously with the parameters. One should 
also expect that there are manifolds partitioning the parameter space into regions, 
such that the ecosystem’s structure is the same at every point in any given region, 
but differs from the structure in the region adjacent to it. Such manifolds are said 
to “bifurcate” the system’s properties. To study the bifurcations, we take b and A to 
be given and we vary C. The reason we permit C to vary is that C denotes human 
intervention and we could in principle control it. 

So consider the equation 

bKV(l + K^) = AK. (23) 

Real solutions of equation (23) are the stationary points of equation (22) with C = 0. 

We begin by assuming that A/b > 1/2, which is to say that the pollutant decays 
rapidly. In this case (23) has only one real solution: it is K = 0. Simple graphics 
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(Figure 2 ) confirm, however, that there are values of C for which equation ( 22 ) 
has three (real) stationary points. Assuming one such value, we label the stationary 
points as Ki (<) K2 (<) K3, respectively. K2 is unstable, while Ki and K3 are 
locally stable. K2 is the separatrix of the system - the point that separates the two 
basins of attraction of the ecosystem. 

7 . 2 . 2 . Ecosystem flips 

Continuing to hold b and X constant, let us now reduce C. It is simple to confirm 
visually that the unstable stationary point (continue to label it K2) and the larger of 
the two locally stable stationary points (continue to label it K3) get closer to each 
other continuously. It is simple to confirm as well that there is a critical value of 
C, call it C*, for which K2 and K3 coincide to form a point that is stable from the 
right, but unstable from the left. C* is a bifurcation point of the system: if C < 
C*, the ecosystem possesses a unique (stable) stationary point, whereas if C > C* 
(but C < C**; see below), it possesses three stationary points. In short, the system’s 
structure changes discontinously at C*.^^ 

In contrast, suppose C were to increase. It is simple to confirm visually 
(Figure 2 ) that the unstable stationary point (continue to label it K2) and the smaller 
of the two locally stable stationary points (continue to label it Ki) would get closer 
to each other continuously, until, at a critical value of C, call it C**, the two would 
coincide, to form a point that is unstable from the right, but stable from the left. 
C** is another bifurcation point of the system: if C > C**, the ecosystem possesses 
a unique (stable) stationary point, whereas if C < C** (but C > C*), it possesses 
three stationary points. 
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Figure 3. Ecosystem dynamics for constant pollutant inflow: the reversible case. 



In Figure 3 we have drawn the equilibrium values of K as a correspondence of C 
for a given pair of values of b and X. Equilibrium K is unique when C < C*. For C 
in the interval [C*, C**], the curve depicting K as a correspondence of C bends back 
and then back again, to reflect the fact that equation (22) possesses three stationary 
points. The two upward sloping portions of the correspondence consist of (locally) 
stable stationary values of K, whereas the downward sloping portion consists of 
unstable stationary points. 

We now conduct a thought experiment. Begin in a situation where C < C*. We 
know that equilibrium K is small. We would like to discover how the system would 
change if C were to increase in a predictable way. Rather than try to integrate 
equation (22), we simplify by imagining that C increases slowly relative to the 
speed of adjustment of Kt. By “slowly” we mean that at each C the ecosystem 
is able to equilibrate itself. If C were to increase under such conditions, K would 
increase continuously along the lower arm of the curve until C = C**, at which 
point equilibrium K would “flip” to the upper arm of the curve. The ecosystem 
therefore undergoes a discrete change at C**. Further increases in C would lead to 
a continual increase in K along the upper arm of the curve in Figure 3. 

Ecosystem flips have been observed many times and at many scales. Shallow 
lakes have been known to flip from clear to turbid water in a matter of months, 
village tanks in a matter of weeks, garden ponds in a matter of hours. Insect popula- 
tions have been known to crash or explode in a matter of days. Farger ecosystems 
generally take longer to flip at their bifurcation points because the underlying 
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processes operate over greater distances, and are therefore slower. Grasslands in 
sub-Saharan Africa can take more than a decade to change into shrublands. The 
“salt conveyor” that drives global ocean circulation would probably take between 
decades and a century to shut down (or change direction) if the Greenland ice cover 
were to melt at rates estimated in current models of global warming.^^ The fossil 
records suggest that the interglacials and glacials of ice ages have appeared only 
occasionally, but have arrived and departed “precipitously” - the flips occuring 
over several thousand years.^^ And so on. 

7.2.3. Hysteresis in ecosystem dynamics 

Now suppose we are interested in reversing the process. Start with C > C** and 
reduce it slowly. Figure 3 shows that on the return journey, K declines continuously 
along the upper arm so long as C > C*. This means that for C in the interval [C*, 
C**], K remains higher than it was on the onward journey. To put it another way, 
the ecosystem displays hysteresis. However, at C = C* the ecosystem “flips” to the 
lower arm of the curve in Figure 3. Further declines in K would occur continuously 
if C were reduced further. We conclude that, even though the ecosystem displays 
hysteresis, environmental degradation is reversible: given enough time, K can be 
made to be as small as we like if C were reduced sufficiently. This is the intellec- 
tual basis of the environmental Kuznets curve, mentioned earlier. As we have just 
confirmed, it would be a correct view of future possibilities if the pollutant’s decay 
rate were sufficiently large. 

7.2.4. Irreversibility in ecosystem damage 

But now consider a less happy possibility. Suppose that A/b < 1/2, which is to 
say that the pollutant decays slowly. In this case equation (23) possesses three 
real solutions. One is K = 0, while the other two are positive. Figure 4, which is 
the counterpart of Figure 2, depicts this case. We now use Figure 4 to construct 
Figure 5, which plots the equilibrium values of K as a correspondence of C. In 
contrast to Figure 3, the curve bends backward to cut the vertical axis. 

Let us conduct the thought experiment again. Suppose we begin in a situation 
where both C and K are low, which means that the system is on the lower arm of 
the curve in Figure 5. As C increases, K increases continuously until the bifurca- 
tion point C is reached. At this point the ecosystem flips to a higher value of K. 
However, once that happens, the system is incapable of reversing itself. Declines 
in C would certainly reduce K, but as Figure 5 shows, even if C were reduced to 
zero, the system would remain on the upper arm of the curve, at a higher value of 
K than it did to begin with. Not only does the ecosystem suffer from hysteresis, but 
environmental degradation is now in addition irreversible: the system is unable to 
return to where it had been in the beginning. 

We are used to the intuitive idea that the presence of thresholds in ecosys- 
tems means that large damages to them are irreversible. The above analysis has 
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shown that large damages can be irreversible even if an ecosystem does not have 
thresholds. Positive feedback, allied to a slow decay rate for the pollutant in the 
ecosystem, is all that is needed. 



8. Optimum Intervention 

Particular interest attaches to optimum intervention. Brock and Starrett interpret Kt 
in equation (22) to be phosphorus load in the water column of a shallow lake. Phos- 
phorus is taken to be a byproduct of agricultural production in the neighbourhood 
of the lake. As a flow, it is therefore a “good”. However, as a stock in the lake, it is 
a “bad”. The flow of utility is taken to be, 

U(Ct, Kt) = logCt — hKt^, where h > 0, (24) 

and intertemporal welfare is assumed to be of the form expressed in (9), which for 
convenience we rewrite here: 

OO 

U(Ct, Kt) exp(— (5t)dt, where i5 > 0. (25) 

The dynamical system can be expressed as 

dKt/dt = Ct -h bKtV(l + Kt^) - AKt, Kq (> 0) given. (26) 

The idea is to maximise (25) subject to equation (26), given that U satisfies (24). 

Brock and Starrett show that, for a range of values of 8, h, X, and b, the pair of 
equations (involving the state and the (Pontryagin) co-state variables) that optimum 
programmes must necessarily possess three stationary points. Two (call them Ki 
and K3, with Ki < K3, corresponding to what could be interpreted to be the oligo- 
trophic and eutrophic state, respectively) are saddle points, while the intermediate 
point (call it K 2 ) is a spiral source (i.e., it is unstable). They note that, for any Kq, 
the two trajectories that asymptote to Ki and K3, respectively, satisfy not only the 
necessary conditions for optimality, but the transversality condition as well. This 
would suggest that, in order to judge which is the correct trajectory, one would 
need to compute expression (25) for each trajectory and then compare the two 
values - a tiresome requirement. However, the authors show that there is a value of 
phosphorus stock, K*, such that if Kq > K*, the optimum programme asymptotes 
to K 3 ; but that if Kq < K*, it asymptotes to Ki. In other words, history matters.^^ 
The also confirm that if by fluke Kq = K*, the two trajectories that asymptote to 
K] and K3, respectively, are equally desirable. The authors refer to K* as a Skiba 
point, in recognition of Skiba (1978), who uncovered the existence of such points 
of indifference. 

Locating K* is no easy matter. The disturbing conclusion is that if the planner 
were by mistake to think that Kq is greater than K*, the path he would choose 
be wrong: the planner would regard the eutrophic state to be the optimal target, 
whereas he ought instead to aim for the oligotrophic state. It would seem that, in 
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order to determine the optimum programme, there is no alternative to having to 
compute the value of expression (25) along both candidate trajectories. 

Brock and Starrett also show that there is a range of parameter values for which 
the optimality conditions possess a single stationary point (a saddle point), so that 
there are no Skiba points. The set of parameter values that separate the two regions 
in the space of parameter values defines a bifurcation surface.^® 

Maler, Xepapadeas, and de Zeeuw study a case that parallels the analysis offered 
in Section 7.1 (Dasgupta 1982). There are a number of non-cooperative farmers 
discharging phosphorus into a lake. The pollutant in the lake is a public bad, but 
each farmer’s discharge is a private good to that farmer. Assuming that equili- 
brium behaviour yields an open loop solution for the differential game played 
by the farmers, the authors identify cases where the farmers drive the lake to a 
eutrophic state, even though the optimal policy would have been to aim toward 
an oligotrophic state. As noted above, once the eutrophic state is reached, it may 
be impossible to bring the lake back into an oligotrophic state. More generally, if 
environmental problems are neglected over a long period, even with all the good 
will in the world it may prove impossible to reverse the situation subsequently. 



9, Multi-Species Forests 

The defining proposition in fhe economic fheory of foresfry is Fausfman’s Rule, 
which idenfifies fhe opfimum age af which frees should be harvesfed in a single 
species foresl. Earlier we observed fhaf for deriving Fausfmann’s Rule, fhe rale al 
which a synchronised foresl’s biomass grows is assumed lo be a concave, quadralic 
funclion of lhal biomass (equation (11)). In multi-species foresls, however, fhe 
regeneralion rale of a species’ biomass depends nol only on ils own biomass, bul 
also on fhe biomass of olher species. For example, when planl species compete 
for sunlighl and nulrienls, fhe biomass of each species has a negalive effecl on 
fhe olhers’ regeneration rates. One conclusion follows immedialely: Because of 
species inlerdependence, Fauslmann’s Rule does nol hold. 

Foresl species are nol all planls. Crepin (Ibis issue) sludies a model of boreal 
foresls, consisting of pine, birch, and moose. Moose feed on bolh pine and birch, 
so bolh free species have a positive effecl on moose biomass. In conlrasl, moose 
biomass has a negative effecl on Ihe growlh of pine and birch. Given Ihe biomasses 
of pine and birch (moose), Ihe regeneration rate of moose (birch) biomass is 
assumed lo be a concave, quadratic function. In conlrasl, Crepin argues lhal when 
pine biomass is small, young pines eslablish belter wilh increasing biomass. So she 
assumes lhal, for any pair of values of moose and birch biomasses, Ihe regeneration 
rate of pine is a convex-concave function of ils own biomass. Each of Ihe Ihree 
species is assumed lo have an economic worlh al each date, bolh as harvesl and as 
slock. The social objective function is Ihe presenl discounted value of Ihe flow of 
economic worlh, from Ihe presenl lo Ihe indefinite fulure. 
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Technically, Crepin’s problem is considerably more complicated than the single 
state-variable model of the shallow lake. Nevertheless, she is able to report substan- 
tial results. She shows that forests may possess multiple basins of attraction, 
meaning that the optimum harvesting policy in a multi-species forest can depend 
on the state at which it is found to begin with. The author derives optimum 
stationary state management rules and characterises the nature of the steady states. 
Faustmann’s Rule holds for neither birch nor pine. 



10. Welfare Economics in Imperfect Economies 

Even in non-convex environments, optimum management policies are known to 
satisfy Pontryagin’s Maximum Principle. Brock-Starrett and Crepin apply the prin- 
ciple to derive the conditions that optimum programmes must necessarily satisfy. 
Brock and Starrett emphasize that, where the Pontryagin necessary conditions for 
optimality possess Skiba points, adding transversality conditions would not suffice. 
What is required in such situations is the sheer brute force of computing social 
welfare along the candidate programmes and then comparing them. 

The Brock-Starrett-Crepin formulations are in the spirit of modern welfare 
economic theory, where policy analysis is conducted in the context of a State 
(or regulatory authority) that selects those policies that maximize social welfare 
subject to technological, environmental, and institutional constraints. In modern 
welfare economics, what differentiates the “first-best” from the plethora of 
“second-best” policies it is possible to envisage is the inclusion of institutional 
constraints when deriving the latter. The State is assumed to act on behalf of its 
citizens even in second-best economies. 

One can argue that this is a misleading approach to policy analysis in dysfunc- 
tional societies. It is hard to imagine the sense in which governments in what 
are demonstrably failed states may be said to be optimizing on behalf of their 
citizens. However, it may not be absurd to imagine that even in the most corrupt 
and predatory of governments there are honest people. It can be safely assumed 
that such figures are minor officials, involved in making marginal decisions (a road 
here, a local environmenfal profecfion plan fhere, and so on). Whaf language does 
welfare economics have fo speak fo such people in whaf we may euphemistically 
call imperfect economies! 

In an earlier work, Dasgupfa (2001b) exfended inferfemporal welfare economics 
fo imperfecl economies by defermining rules fhaf can be used fo evaluafe small 
perfurbafions fo macroeconomic forecasls. A forecasl is based on a reading of 
technological and environmenfal possibilities, and on fhe behaviour of households, 
firms, communities, and fhe Sfafe. A perfurbafion fo fhe forecasf is inferprefed as 
a projecf (or more generally, a small policy change) under fhe jurisdicfion of fhe 
honesl civil servanf. Dasgupfa showed fhaf fhe required evaluafion rule involves 
fhe use of accounfing prices fhaf can in principle be esfimafed by perfurbing fhe 
forecasl. The rule ilself is fo check whelher fhe presenl discounted value of fhe 
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flow of social profits generated by the perturbation is positive. In other words, the 
choice criterion is the one that has for long been advocated for social cost-benefit 
analysis in optimizing economies. Dasgupta’s analysis did not require the economy 
to be convex. 

The entire intellectual machinery deployed by Dasgupta (2001b) was taken 
from Dasgupta and Maler (2000), who had developed a criterion for assessing 
whether or not intertemporal social welfare is sustained along an economic fore- 
cast. The authors had shown in the context of a simple economic model with 
constant population that the same accounting prices as those that can be used in 
social cost-benefit analysis can also be used to compute an index for assessing 
welfare sustainability. The index in question was shown to be a comprehensive 
measure of wealth, or in other words, the social value of the entire stock of the 
economy’s capital assets, inclusive of manufactured, human, and natural capital 
assets.^^ The Dasgupta-Maler analysis did not require the economy to be convex. 

Arrow, Dasgupta, and Maler (this issue) extend the findings in the earlier two 
papers by answering three related questions in the context of a very general model: 
(1) How should accounting prices be estimated? (2) How should policy changes 
in an imperfect economy be evaluated? (3) How can one check whether intertem- 
poral social welfare will be sustained along a projected economic programme? 
The authors do not presume that the economy is convex. They show that the same 
set of accounting prices should be used both for policy evaluation and for assessing 
whether or not intergenerational social welfare along a given economic path will be 
sustained. They also confirm that a comprehensive measure of wealth, computed 
in terms of the accounting prices, can be used as an index for problems (2) and 
(3) above. They extend the earlier results by including changes in the size of 
population. They also show how uncertainty in future possibilities can be accom- 
modated in the comprehensive measure of wealth. The bulk of their paper contains 
rules for estimating the accounting prices of several specific environmental natural 
resources, transacted in a few well known economic institutions. 



11, Time and Space 

The papers included here on the welfare economics of environmental resources 
take time seriously, but not space. In this they reflect the current state of environ- 
mental and resource economics accurately. And yet, the spatial dimension is of 
the utmost importance. At the grandest scale, terrestrial ecosystems differ from 
marine ecosystems. But even within each there is a wide distribution, covering 
height, depth, and spread. Individual members of a population not only interact 
with one another (involving both inter- and intra-species exchanges), but they affect 
and are in turn affected by the abiotic processes at different sites. The evolution 
of landscapes is modulated by such interactions. In any given patch, populations 
breed and die. They also disperse to other sites within and outside the patch. The 
physical, chemical, and biological processes at work operate at different scales. 
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both spatially and temporally. Together they give rise to spatially struetured popula- 
tions and landseapes. What we observe is a system that is not only spatially and 
temporally patchy, but also modular.^^ 

Despite the self-evident importance of the spatial aspect of Nature, welfare 
economic theory is not yet spatially sensitive. If a community has to extract 
resources from its local ecosystem, which patches should it harvest, what should 
it harvest, and when? Applied studies inform us of the harvesting strategies rural 
communities in the African savannahs, say, have adopted over time in response to 
the spatial and temporal character of their local ecosystems. Frequently enough, 
rationale behind their strategies have been unearthed as well. But are they optimal 
from the community’s point of view? How would we know? As matters stand, 
even the mathematical tools that would be required to address those problems are 
not familiar in the economics literature. It is a potentially very fruitful area for 
future research. 



12. Good and Bad Positive Feedbacks, and Rich and Poor People 

Contemporary models of economic growth are dismissive of the importance of 
Nature. In their extreme form, growth models contain an assumed positive link 
between the creation of ideas (technological progress) and population growth in a 
world where the natural-resource base comprises a fixed, indestructible factor of 
production.^^ The models do involve positive feedback, but of a Panglossian kind. 

The error in this literature lies in the fact that Nature is not fixed and indusfrucf- 
ible, buf rafher, if consisfs of degradable resources (agriculfural soil, wafersheds, 
fisheries, and sources of fresh wafer; more generally, ecological services). If may 
be sensible fo make fhe wrong assumption for sfudying a period when nafural- 
resource consfrainfs did nol bile, buf if is nol sensible when sfudying developmenl 
possibilifies loday. The lalfer move is especially suspecl when no grounds are 
offered by growlh Iheorisls for supposing fhal lechnological progress can be 
depended upon indefinilely fo more lhan subslilule for an ever-increasing loss of 
fhe nalural-resource base. 

In any evenl, we should be sceplical of a Iheory fhal places such enormous 
burden on an experience nof much more lhan Iwo hundred years old. Exlrapolalion 
info fhe pasl is a sobering exercise: over fhe long haul of history (a 5,000 years 
slrelch, say, upfo aboul Iwo hundred years ago), economic growlh even in fhe 
currenlly-rich counfries was for mosl of fhe time nof much above zero.^^ 

Positive feedback in ecological and mefabolic palhways are reasons why fhe 
prospecls of economic beflermenf among fhe world’s pooresl are much harder lhan 
among fhe rich. For one Ihing, fhe poor in a poor economy have fo operafe on 
fhe boundary of fhe non-convex region of Iheir nulrilion-produclivily possibilities, 
whereas people who possess sufficienl assels are able to gel onto fhe boundary of 
Iheir convex region.^^ For anolher, fhe non-convexilies fhe poor face can also be a 
reflection of Iheir inabilily fo oblain subslilules for deplefed nalural resources. As 
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we noted earlier, resouree depletion for the poor can be like crossing a threshold: 
their room for maneouver is circumscribed hugely once they cross. In contrast, the 
rich can usually “substitute” their way out of problems. 

The simultaneous presence of two types of positive feedback - one enabling 
many to move up in their living standard, the other keeping many others in poverty 
- may explain the large scale persistence of absolute poverty in a world that 
has been growing wealthier on average by substituting manufactured and human 
capital for natural capital. For human well-being, policies matter, as do institutions, 
but the local ecology matters too. If in this Introduction we have focussed on the 
positive feedback mechanisms that operate at the downside of life, it is because 
degradation of the natural-resource base is felt first by the poor, not the rich. 
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Notes 

1. Koopmans (1957) and Debreu (1959) are the classic expositions of this. For completeness, here 
is the definition of convexity of a set: 

A commodity vector, say z, is a convex combination of commodity vectors x and y if z is a 
weighted average of x and y, where the weights are non-negative and sum to unity (that is, z = 
ffx + (l-a)y for some ae [0,1]). A set of commodity vectors is said to be convex if every convex 
combination of every pair of commodity vectors in the set is in the set. A set is non-convex if it 
is not convex. 

2. May (1975) and Murray (1993) contain illuminating accounts of both the mathematical and 
ecological reasoning. 

3. See Perrings et al. (1995), Levin and Barrett et al. (1998), and Gunderson and Holling (2002). 
We discuss these issues formally in Section 7. 

4. In an earlier classic. Arrow (1971) had observed that markets for externalities would suffer from 
another problem: no matter whether the externalities are positive or negative, the markets would 
be “thin”. 

5. Over the past three decades much work has been done by economists to develop the theory of 
environmental policy. Baumol and Oates (1975) continues to be the outstanding treatise on the 
subject. 

6. See also Cropper and Griffiths (1994) and Grossman and Krueger (1995). 

7. It is, of course, a misnomer. The original Kuznets curve, which was an inverted U, related income 
inequality to real national income per head on the basis of historical cross-country evidence. 

8. In Section 7 we review the mathematics of bifurcations and irreversibilities. 

9. For further discussions of the environmental Kuznets curve, see Arrow and Bolin et al. (1995) 
and the responses it elicited in symposia built round the article in Ecological Economics, 1995, 
15(1); Ecological Applications, 1996, 6(1); and Environment and Development Economics, 
1996, 1(1). See also the special issue of Environment and Development Economics, 1997, 2(4); 
and Dasgupta, Levin and Lubchenco (2000). 

10. WHO (1983, 1985) and Spurr (1988, 1990) are classic publications on the relationship between 
nutritional status and human productivity. For evidence on synergies between nutritional and 
disease status, see Scrimshaw et al. (1968) and Harrison and Waterlow (1990). 
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11. See the interchange between Johnson (2001) and Dasgupta (2001a) on this. For a more detailed 
analysis of the connection between environmental and resource economics and the economics 
of poverty, see Dasgupta (1982, 1993, 2000, 2003). 

12. See for example. Lutz (1993), Heal (1998), and Weitzman (2000). 

13. The meetings were financed by a generous grant from the MacArthur Foundation in Chicago. 

14. Brock and Malliaris (1989) is an excellent treatise in that area. 

15. As is well known, discrete- and continuous-time systems can differ fundamentally in regard to 
their stability properties. The decision to work in continuous time is therefore not an innocuous 
one. As is also well known, the structure of “non-linear” dynamical systems even in continuous 
time is currently understood best when the number of state variables does not exceed 2. Readers 
should note that the analysis that follows is undertaken on one-dimensional natural systems. 
Finally, it goes without saying that restricting the study to deterministic systems is hugely 
limiting. Barring Chave and Levin and the final section of the paper by Arrow, Dasgupta, and 
Maler, all the articles in this issue study deterministic systems. For stability properties of random 
dynamical economies in discrete time, see Bhattacharya and Majumdar (2003). 

16. Koopmans (1957) contains an especially thoughtful discussion of “free disposability”. 

17. The Pontryagin co-state variable associated with the optimum is the required accounting 
price. Kamien and Schwartz (1991) contains a good account of dynamic optimization theory. 
Accounting pirces are also called shadow prices. 

18. See also Keeler, Spence and Zeckhauser (1972). 

19. Johansson and Lofgren (1985) contains a thorough account. 

20. May (1989) contains a fine, non-technical exposition of a number of issues raised by Chave and 
Levin. 

21. For a fine study of the economics of chaotic systems, see Majumdar, Mitra and Nishimura (2000). 

22. Each of the additional notions can also be developed by means of the fishery’s model of the 
previous sub-section. 

23. Maler (1974) contains a general equilibrium analysis of resource allocations subject to the 
materials-balance condition. 

24. Close variants of equation (22) have been postulated for a number of natural systems. Here are 
three examples: 

(1) In order to explain periodic infestations of the spruce budworm in boreal forests, Ludwig, 
Jones, and Holling (1978) postulated that the budworm’s population, Kt, changes in accordance 
with the equation 

dKt/dt = aKt - - bKt^/(l -f Kt^), (a, ^,b>0), (22a) 

where the final, forcing term denotes predation by birds. 

(2) The account of the Atlantic thermohaline circulation in Rahmstorf (1995, 2002) can be 
formalised in terms of an equation not dissimilar to (22). Temperature and salt gradients across 
the North and South Atlantic gives rise to the circulation. Kt is taken to be the North Atlantic 
deep water flow (travelling south) and C is the amount of fresh water entering, say, the surface 
of the North Atlantic (in part from ice melts). Circulation can come to a halt if C is too large. 

(3) Scheffer (1997) has used equation (22) to characterise the dynamics of phosphorus deposit 
in the water column of shallow lakes when C is the inflow of phosphorus into a lake from 
neighbouring farms. The papers by Brock and Starrett, and Maler, Xepapadeas, and de Zeeuw 
(this issue) study that model. 

(4) Vegetation cover in the savannahs depends on rainfall, but rainfall in turn depends on 
vegetation cover. Denoting rainfall by Ct and vegetation (in biomass) by Kt, suppose, as a first 
approximation, that 

Ct = ffKtanddKt/dt = bCt^/(U-Ct^)-7.Kt, (o:,b, X>0). (22b) 

The pair of equations (22a, b) are variants of (22). 

25. Mathematicians call this a “saddle-node bifurcation”. 

26. Rahmstorf (2002). 
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27. See van Andel (1994) and Rahmstorf (1995). 

28. Although, for ease of exposition, we are using the same notation, the points Kj , K 2 , and K 3 here 
are not the same as the points Kj , K 2 , and K 3 in the previous section. 

29. Skiba (1978) showed that in non-convex economies the optimality conditions may possess 
multiple stationary points even if the utility function is independent of stocks. 

30. Wagener (2003) has shown that the bifurcation curves in the above system are what mathemati- 
cians call “heteroclinic bifurcations”. 

31. See for example, Pastor and Mladenoff (1992). 

32. Pearce and Atkinson (1993) had earlier proved that an inclusive measure of wealth can be 
used to assess the sustainability of intertemporal welfare in an optimizing world with constant 
population. 

33. Levin (1999) has an excellent non-technical account of this. 

34. Kremer (1993) develops such a model to account for 1 million years of world economic history. 

35. See Fogel (1994, 1999), Johnson (2000), and especially Maddison (2001). The claim holds even 
if the past two hundred years were to be included. The rough calculation is simple enough: 
World per capita output today is about 5,000 US dollars. The World Bank regards one dollar a 
day to be about as bad as it can be: people would not be able to survive on anything much less 
than that. It would then be reasonable to suppose that 2,000 years ago per capita income was not 
less than a dollar a day. So, let us assume that it was a dollar a day. This would mean that per 
capita income 2,000 years ago was about 350 dollars a year. Rounding off numbers, this means 
very roughly speaking that, per capita income has risen about 16 times since then. This in turn 
means that world income per head has doubled every 500 years, which in its turn means that the 
average annual rate of growth has been about 0.14 percent per year, a figure not much in excess 
of zero. 

36. Dasgupta and Ray (1986). To illustrate, the undernourished are at a severe disadvantage in their 
ability to obtain food: the quality of work they are able to offer is inadequate for obtaining the 
food they require if they are to improve their nutritional status. Over time undernourishment can 
be both a cause and consequence of someone falling into a poverty trap. Because undernourish- 
ment displays hysteresis, such poverty can be dynastic: once a household falls into a poverty 
trap, it can prove especially hard for descendents to emerge out of it. Many poverty studies 
involving econometric exercises (including many that explore nutrition and health) assume linear 
relationships among the stipulated variables. By construction they are incapable of detecting the 
non-convexities inherent in metabolic and ecological processes. 
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Abstract. We review various aspects of the notion of scale applied to natural systems, in particular 
complex adaptive systems. We argue that scaling issues are not only crucial from the standpoint 
of basic science, but also in many applied issues, and discuss tools for detecting and dealing with 
multiple scales, both spatial and temporal. We also suggest that the techniques of statistical mecha- 
nics, which have been successful in describing many emergent patterns in physical systems, can also 
prove useful in the study of complex adaptive systems. 

Key words: criticality, ecology, economy, scale, statistical mechanics 



1. Introduction 

One of the fundamental truths in the study of natural systems is that there is no 
single eorrect seale on whieh to study dynamies. In ecologieal systems, processes 
(such as behaviors) at the level of whole organisms represent the collective 
dynamics of cells (such as neurons) and molecules operating on much faster 
time scales, largely shaped by evolutionary processes operating over much longer 
scales of space and time. Similarly, organizations and economies, societies and the 
Internet exhibit collective dynamics that emerge from the behaviors of individual 
agents, mediated through levels of interaction and aggregation. The distinctions 
among scales need not be sharp, as between individuals and their societies, but 
may represent a continuous gradation, as in atmospheric phenomena, across many 
orders of magnitude. It is thus of fundamental importance to recognize how our 
perceptual scales condition the way we describe systems, how patterns change 
across scales, and how phenomena at different scales influence one another (Levin 
1992 , 1993 ). 

These pervasive scientific verities become of applied importance when attention 
turns to the interactions among systems with different dominant scales of activity. 
The emergence of antibiotic resistance, for example, has become a problem of 
overwhelming concern because typical bacterial generation times are much shorter 
than those of humans; and what is perhaps more relevant, resistance has arisen 
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faster than we have been able to produee genuinely novel antibioties. Other 
disease-eausing organisms, such as Plasmodium (the etiological agent of malaria), 
influenza and HIV have also presented challenges to the development of treatments 
because of similar mismatches in time scales. 

These examples make clear that, although we usually think of evolutionary 
change in biological systems as operating much more slowly than in socioeco- 
nomic ones, the interconnection between biological and socioeconomic systems 
brings these scales much more into concord. Human activities have sped the evolu- 
tion of antibiotic resistance, of pesticide resistance in agricultural systems, and of 
heavy metal tolerance in plants. Harvesting practice, similarly, has influenced the 
life history dynamics of fish species, selecting for those that mature earlier, and at 
smaller sizes. Nor do we need to be reminded that human activities increasingly 
are leading to extinction of species and to changes in the composition of our atmo- 
sphere, again accelerating the scales of environmental change well beyond what 
we have seen in previous centuries of human existence. 

Disease management, global change and conservation of Earth’s biodiversity 
are only some of the spectrum of environmental issues that exist at the interface 
between natural systems and sociopolitical ones. Models can play a fundamental 
role in informing decision-making, because they allow the integration of processes 
operating at diverse scales. However, the art of developing models to act across 
widely different scales of space, time or organizational complexity involves more 
than just the inclusion of every possible detail, at every possible scale. In the study 
of general circulation models, for example, building models with finer and finer 
spatial resolution not only is limited by computational complexity; it also intro- 
duces detail that can confound interpretation. The same applies to the analysis of 
any nonlinear system, in which the challenge in moving across scales is not to 
include as much detail as possible, but to find ways to suppress irrelevant detail 
and simplify system description (Ludwig and Walters 1985; Levin 1991; Levin 
and Pacala 1997). No sensible scientist would try to build a model of the behavior 
of an individual organism by accounting for the processes within every cell, tied 
together in a network of interaction of numbing complexity. Similarly, no sensible 
scientist should try to build models of ecological systems in which one reproduces 
the behaviors of every organism, or even every species. The goal rather should be 
to identify relevant detail, and to describe the dynamics of whole systems in terms 
of the statistical properties of the units that make them up. 

Methods of statistical mechanics permit extrapolation from the microscopic to 
the macroscopic in many physical systems. To describe how water boils or ice 
melts, or how fluid flows, we do not require detailed information on the atomic 
structure of the constitutive particles, or their positions and movements. Lrom the 
rules governing the movement of individual particles, we can derive partial differ- 
ential equations describing their collective behaviors. Indeed, such models may 
be mathematically complex, yielding instabilities, chaotic dynamics and nonlinear 
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pattern formation. Nonetheless, they reliably eapture the maeroseopie behavior in 
terms of the microseopie displaeement of the partieles. 

In biologieal and socio-economic systems, deriving such macroscopic equa- 
tions is also a worthwhile goal, although much more difficult to achieve, despite 
recent progress (Durlauf 1993; Flierl et al. 1999). First of all, unlike physical 
systems, the underlying processes are not well-described. Secondly, the range of 
actions of biological and economic agents, involving rational and not-so-rational 
decision-making in the face of complex environmental signals, adds a novel form 
of complexity, beyond the mathematical complexity of relatively simple physical 
systems. Thirdly, and most vexing, the elements that make up biological and socio- 
economic systems cannot be well-represented as uniform ensembles of billiard 
balls. It is the heterogeneity of these systems, and the importance of the continual 
infusion of new types of unpredictable character, that sets these systems apart. 
These characteristics define complex adaptive sysfems (Levin 1999, 2000), in 
which pattern emerges from fhe inferplay befween processes fhaf generate novelfy 
and fhose fhaf winnow fhaf novelfy, based on localized inferacfions among self- 
replicafing enfifies of diverse characfers. Economies, sociefies, ecosystems and fhe 
biosphere are profolypical complex adapfive sysfems. 

Issues of scaling are fundamenlally involved in fhe analysis of complex adapfive 
sysfems, in which macroscopic behaviors of collecfives emerge from microscopic 
inferacfions among individual agenfs. In Ibis paper we will direcf attention fo such 
complex adapfive sysfems, across a range of applications. Individuals may inferacf 
locally by mechanisms of affraclion and repulsion, by exchange of biomass, energy, 
chemical compounds, financial currencies or informalion; or by ofher forms of 
signaling. These processes engender phenomena al higher levels, such as aggrega- 
tion info herds or cities, cooperafion and warfare, fhe developmenl of social norms 
and endogenous organization info societies and religions. 

By focusing attention on emergenf phenomena, we seek lo explicafe fhe 
mechanisms underlying pattern. If mighl seem fhaf fheories developed for billiard 
balls mighf be completely inappropriate as poinls of deparlure for describing fhe 
colleclive dynamics of conscious and ralional agenfs, such as humans. Nonelhe- 
less, fhe poinl is fhaf many phenomena arise as slalislical regularities, independenl 
of fhe fine-scale delail. Collecfives Ihen achieve slalus as super-organisms, whose 
behaviors may nol be conscious in fhe whole, bul may be described as if fhey were. 
One often hears markel experls, for example, reporl fhaf “fhe markel was nervous 
today”, or similar expressions. Indeed, fhe notion fhaf fhe markel has real ceilings 
and floors is nol only an anlhropomorphism, bul is self-fulfilling if fhe individual 
agenfs fhaf comprise fhe markel believe if. Financial crashes and recoveries, Iraffic 
jams, aggregation in animal societies, are lypical of fhe dynamics found in complex 
adapfive sysfems. 
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2, Scaling Laws in Natural Systems 

A convenient place to begin in the study of the effects of scale is to ask how 
one set of variables changes in relationship to changes in others. In the small, of 
course, that is the stuff of the differential calculus; but scaling laws seek to capture 
those relationships across a range, and in terms of compact analytical relationships. 
Brock (1999) defines a scaling law as a “common property of a set of plots of one 
quantity against another”. He furthermore reminds us that Gell-Mann (1994, p. 97) 
attributes the admission to Mandelbrot that “quite frankly . . . early in his career he 
was successful in part because he placed more emphasis on finding and describing 
the power laws than on trying to explain them”. 

In many applications, the search for scaling relationships has focused on length 
or time scales as the primary variables, and asked how other variables change in 
relation. Mandelbrot (1975) has illustrated the power of such analyses, demon- 
strating for example how the measurement of the perimeter of a land mass, or 
a snowflake, or a cloud may vary in relationship to the length of the measuring 
stick used. Following Mandelbrot (1975), we may ask what the length Lg of 
a coastline, when measured with a measuring stick of length e, is. This length 
generally increases steadily as s decreases, suggesting that only makes sense 
if one has specified the scale first. The relation between and s is of the form 
Lg ~ s~‘^f as s tends to zero, which allows us to define the fractal dimension 
df = limg^o of tho set (Hastings and Sugihara 1993). It is possible, and 
indeed quite easy, to give rules for the construction of “self-similar” sets, for which 
the above scaling law Lg ~ s~‘^f is in fact valid at all scales. An example is von 
Koch’s snowflake (Figure 1, Mandelbrot 1975). The existence of such relationships 
implies that geometric concepts such as perimeter may not be absolutes, but make 
sense only relative to the length scale used, so that there are no natural scales 
that must be employed. Such scale-free phenomena are ubiquitous in Nature, and 
beg explanation. They are characteristic of critical phenomena in physical systems, 
but can also arise in other ways, as for example in the development of bronchial 
networks (Barabasi et al. 1996; West et al. 1997). 

In ecological systems, such scaling relationships in regard to length (or area or 
volume) are among the most robust empirical generalizations found. Particularly 
influential have been allometric relationships, which govern the dependence of 
variables such as metabolic rate on body size; species-area curves, which relate 
the number of species found to the area of study; and self-thinning laws, which for 
example relate the total biomass of a forest to the mean size of the trees. 

The relationship between metabolic rate M and body mass B is a well- 
documented one, often fitted by a power-law function (Calder 1984; Peters 1983, 
see Figure 2): 

M ^ aB^ 

When plotted on log-log paper, M is a linear function of B, and the straight line has 
a slope b. The coefficient a varies across the life forms, and the exponent b between 
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Figure 1. Two classic examples of fractals. Top: first three stages in the construction of 
von Koch’s snowflake fractal (from left to right). The total area increases as the procedure 
iterates. Assuming that the first-stage figure has an area AO, the second-stage triangle has 
an area Ag -I- 3A\ = Ag -I- 3/9Ag = 4/3Ag. More generally the nth-stage figure has an 
area (1 + 3/5(1 — (4/9)”"*“'))Ag, which tends to 8/5Ag as n goes to infinity. The perimeter, 
however, is unbounded, as its length is multiplied by 4/3 from one stage to the next. The fractal 
dimension of the von Koch fractal is ln(4)/ln(3). Bottom: fractal structure of the coastline of 
Norway. 
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Body mass (kg) 



Figure 2. Metabolic rate versus body mass for mammals and birds, logarithmic axes. 
Reproduced from N. Schmidt-Nielsen (1984), with permission. 



2/3 and 3/4. The exponent 2/3 ean be justified for endothermie animals by the 
equilibrium between the thermal energy produeed, proportional to the metabolie 
rate M, and the energy that ean be dissipated, proportional to the body area, which 
scales as length squared if shape is preserved. Since the body mass is proportional 
to length cubed for constant shape, one should expect M ~ . On the other 

hand, arguments have been presented in favor of the 3/4 law for aquatic organisms 
(Niklas 1992), and more generally (West et al. 1997). We shall return to these 
arguments when we discuss scaling laws from an evolutionary perspective. 

Trees have been much studied in the context of allometric relationships. The 
relation between tree height and stem diameter, for example, is mechanically 
constrained. Indeed, a tree cannot only grow in height (primary growth), for the 
stem would be unable to prevent it from buckling. The maximal height H reached 
for a stem diameter D can be found by means of dimensional analysis. The torque 
exerted on the tree by wind, or some other destabilizing factor, is proportional to 
while the elastic force is proportional to the cross-sectional area of the stem, 
D^. Equating these two forces yields H ~ . In competitive environments, 

such as tropical rain forests, trees often attempt to maximize their height and their 
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Area (in square meters) 

Figure 3. Species area curve for vascular plant species, for 316 islands, regions, or contin- 
ents (after Williams 1964; Major 1988; Prance 1978, 1994). The rightmost point is a recent 
estimate of the number of vascular plants on Earth (320,000 species). Note that hoth axes are 
logarithmic. A best fit with a power-law function gives S — cA^, with z = 0.23. The apparent 
convexity of the dataset is probably due to the under-representation of the tropical biome. 



form factor is only limited by buckling (O’Brien et al. 1995). A similar allometric 
relationship has also been found for the largest trees in the U.S. forests (McMahon 
1975). Likewise, a relation between tree mass and diameter of the form B ~ aD^ 
has been suggested. While a simple dimensional analysis suggests that b should 
be 3, Enquist et al. (1998) have shown that the exponent 8/3 may be more appro- 
priate. Empirical analysis using more than 1,800 tropical trees yields a value oib = 
2.55 (Chave et al. 2001). 

Species-area curves probably are the most famous empirical generalization in 
ecology. It has been long observed that the number of sampled species increased 
when the survey area was increased. In the beginning of the twentieth century, 
Arrhenius (1921) suggested that the allometric dependency of species number 
with area should follow a power law S ~ cA^. Since then, an impressive body 
of studies has sought to test this relationship in a large variety of systems and 
have found values of z ranging between 0.1 and 0.4 (Connor and McCoy 1979; 
Rosenzweig 1995, Eigure 3). Such power-law relations suggest self-similarity, and 
provide attractive generalizations (see for example, Harte et al. 1999). Yet, recent 
studies (Condit et al. 2000; Plotkin et al. 2000; May and Stumpf 2000; Crawley 
and Harral 2001) question the universality of these relations. 
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In physics, scaling laws have similarly received much attention. Ohm’s law, 
which predicts a linear scaling of the electrical potential when the electrical 
intensity through a conductor is varied, has never been rigorously proved, but 
emerged as a robust experimental result during the nineteenth century. The critical 
behavior of physical systems near a transition point has also yielded important 
experimental results. The susceptibility, a measure of how sensitive the system is 
to small perturbations near the transition, scales with the distance to the transi- 
tion. Examples of such systems are water near the boiling point, iron becoming 
spontaneously magnetized, or the transition to superconductivity. 

In economics, the use of scaling laws has also been highly informative, espe- 
cially since the work of Zipf (1949). Zipf’s law states that city size decreases in 
inverse proportion to its rank (Makse et al. 1995). Other power laws have been 
applied to the distribution of the sizes of firms (Gibrat’s law, see Ijiri and Simon 
1977; Amaral et al. 1997; Brock 1999), ofpersonal income (Pareto’s law) and of 
stock market gyrations. Indeed, the scaling of market fluctuations with the size of 
the temporal window is the basis for options pricing. Pareto’s law (Pareto 1896; 
see Mandelbrot 1963) states that the density of individuals with incomes greater 
than u satisfies the form f{u) ~ where a is a scaling exponent. MacAuley 

(1922) criticized Pareto’s generalizations on several grounds, pointing out that the 
exponent varies among countries, and disputed its validity for small incomes. Still, 
it remains a popular notion. 

3. Detecting Scales 

Finding the relevant scale of description is an important step in the theoretical 
analysis of a natural system. Usually, only a few scales are relevant, each corre- 
sponding with one process that drives the dynamics of the system. In many cases, 
the investigator must perform rather sophisticated analysis of data to detect the 
relevant scales in the system of study, and we give a few examples of these 
techniques. 

Detecting relevant scales is of crucial importance in areas such as economet- 
rics. If one is able to find the regularities in financial data, an advantage will be 
gained in attempting to predict the future evolution of the market. The science of 
financial markets has witnessed a very rapid development during the 1990s (for 
an introduction, see Mandelbrot 1997; Mills 1999; Bouchaud and Potters 2000). 
Financial markets are a good example of complex adaptive systems, and data are 
widely available. Figure 4 (top, left) displays Standard & Poor’s 500 financial index 
(daily data normalized by the annual average). We use these data to present some 
of the techniques used to detect relevant temporal scales. Clearly, this signal is not 
purely random, as one can see from the histogram (Figure 4, bottom, right), which 
is skewed to the right. 
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Figure 4. Detecting the scales in the Standard & Poor’s 500 financial daily index, from 1928 
to 1997 (data from Mills 1999). Left, top panel: standardized normalized S&P500, (x — x)/x, 
where x is the daily index and x is the annual index (running average). Right, top panel: 
Fourier spectrum 5(ft;) of the time series. The spectrum decreases with frequency (i.e., large 
fluctuations are more rare) like a power law S(co) ^ . Left, bottom panel: correlation 

function for the temporal signal. The correlation has a characteristic time of about a year. 
Right, bottom panel: histogram of the temporal signal. The histogram is obviously skewed 
towards the right, and non-normal. 



In order to detect temporal scales in this signal, one usually uses the correlation 
function: 

C(r) = (/(0/(^ + r)>-(/(^))^ 

where the brackets denote the time average. C(t) gives the probability that the 
signal is identical at two instants separated by a time lag r. The function does not 
depend upon t if the studied time process is stationary. In Figure 4 (bottom, left), we 
have plotted C(t) normalized by the variance off, {f(t)^), so that C(0) = 1. Many 
processes have a “finite memory”; that is, the corresponding correlation function 
falls off exponentially as r goes to infinity. For these types of processes, one defines 
fhe correlafion fime as 



C(r) ^ exp(-T/r) 
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For the S&P500 data, we find T = 421 business days. Beyond this time, one can 
assume that the events are perfectly uncorrelated. For time lags much smaller than 
T, on the contrary, one can assume that there has been no loss of memory. For 
further discussion of scales in finance, fhe reader is referred fo Mandelbrof (1997). 

Many lime series show a certain level of periodicily. These may be caused by 
exogenous forces, such as climale lluclualions, or by biofic lluclualions, such as 
prey-predalor cycles. An appealing idea is lo decompose a signal info weighted 
sums of oscillating functions wilh differenl periods, for example sine or cosine 
functions. This procedure is fhe Fourier decomposilion. A Fourier speclrum 
(Figure 4, lop, righl) gives fhe relafive magnilude of fhe differenl frequencies in 
a signal. There should be only one peak if Ihe function is a sinusoidal. In our 
example, we see lhal Ihe frequencies have Ihe same magnilude from 0. 1 lo around 
0.4 (in year unils, 1 yr = 270 business days), and Ihen fall off as a power law. This 
suggesls a lirsl temporal scale of around 100 business days. 

These techniques also apply for spatial patterns. Fourier Iransforms can be 
used for multidimensional dala, or along a predefined direction. Spatial correla- 
tion functions are also easily defined. In facl many olher techniques have been 
developed in Ihe conlexl of pattern recognition and image processing. One of Ihe 
mosl interesting of Ihese techniques is Ihe wavelel Iransform. The wavelel Irans- 
form compares wilh Ihe Fourier Iransform in lhal Ihe dala (often multidimensional) 
are decomposed into a linear combination of simple orlhonormal functions (/>, (x) ; 
lhal is, functions such lhal / <pi{x)<pj{x)dx = S(i — j), where i5(/) = 0 if 1 7^ 0 
and i5(0) = 1. However, instead of using sine or cosine functions to form an 
orlhonormal basis as in Ihe Fourier Iransform, wavelel Iransforms use functions 
parameterized by a scale factor. In olher words, while Ihe Fourier speclrum informs 
one aboul Ihe magnilude of a cerlain frequency in Ihe pattern, Ihe wavelel speclrum 
informs aboul Ihe magnilude of a cerlain scale in Ihis pattern. Therefore, Ibis is a 
very nalural tool to delecl spatial scales in an image. 



4. The Adaptive Significance of Scale 

We now lurn our attention to Ihe evolutionary consequences of scale in biolo- 
gical systems. Mechanical and Ihermal conslrainls for maintenance of life and 
for reproduction of biological organisms are al Ihe core of Ihe Iheory of nalural 
selection. D’Arcy Thompson lirsl broughl forward Ihe relation belween design 
and mechanical conslrainls for organisms. The shape of a bone and Ihe size of a 
Iree are Ihe resull of evolutionary optimization wilhin a game-lheorelic framework. 
Observed allomelric relationships Iherefore are Ihe producls of evolution as well, 
and expose implicil Iradeoffs and conslrainls. 

Following Ihis idea, aulhors such as Niklas (1992) and Wesl el al. (1997) lake an 
engineering approach to explaining observed allomelric relationships, wilh remark- 
able success. They hypolhesize appropriate conslrainls and optimization principles 
(in terms of efficiency), and from Ihem derive observed relationships. For Irees, one 




SCALE AND SCALING IN ECOLOGICAL AND ECONOMIC SYSTEMS 



39 



constraint is that the tree should be area-preserving; that is when it branches, the 
cross-sectional area before the branch equals the total cross-sectional area beyond 
the branch (Shinozaki’s pipe model, Shinozaki et al. 1964; Enquist et al. 1999). 
However, such a choice of constraints can be criticized (Horn 2000), and compa- 
rison with real data is often difficult (Schmidt-Nielsen 1984). In addition, as Jacob 
(1977) has pointed out, evolution does not optimize, but tinkers; furthermore, all 
evolution takes place within the context of other types, which argues for game- 
theoretic approaches. Despite these caveats, this theory has been astoundingly 
successful. 



4.1. SCALE AND THE EVOLUTION OF DISPERSAL 

Environments vary and the degree of variation depends upon the temporal and 
spatial window of observation. A wise investor considers this in her strategies, 
and manages portfolios by widening the window of investments to spread risks. 
Evolution takes a similar path to reduce the risks that organisms face, for example 
by spreading risks through dormancy or dispersal. 

The reproduction of plant species is highly limited by the dispersal stage. Plant 
species exhibit a vast spectrum of dispersal strategies to address this problem. 
Some species invest in a few large seeds, dispersed nearby the parent plant, while 
some species produce a tremendous amount of seeds that can be dispersed far. In a 
tropical forest of Erench Guiana, the smallest seeds are less than a milligram (e.g., 
Cecropia spp., Cecropiaceae), while some are well over a kilogram (Couroupita 
guianensis, Lecythidaceae). The diversity of strategies is evidence of the fact that 
there is no single optimum strategy, but rather that adaptations occur within a 
game-theoretic framework. 

While most propagules are dispersed near the parent plant, a few centimeters 
for weeds to a few meters for trees, some are dispersed far away. Short-distance 
dispersal is mostly driven by gravity. In contrast, long distance dispersal is mostly 
mediated by animals (Eragoso 1997), wind (Nathan et al. 2000) or water (Whit- 
taker et al. 1989). At the plant scale, the adaptation of the dispersal mode to 
environmental constraints is crucial for reproduction. At the stand scale, dispersal 
patterns control the spatial distribution of species and their coexistence (Hubbell 
1997; Pitman et al. 1999; Chave et al. 2002). Short range dispersal is crucial to 
population persistence at this scale. 

At the landscape scale, dispersal controls the dynamics of ecological succes- 
sion after major disturbances. Eor example, after the last glacial stage (18,000 
radiocarbon years before present), forest tree species migrated northwards in North 
America. The rate at which this process occurred has puzzled scientists for decades, 
because they assumed that the expansion of the range of a species should occur 
gradually, as a result of multiple short-distance dispersal events (Skellam 1951). 
The rapid spread, however, has its explanation in rare occurrences of long-distance 
dispersal events as evidenced by Clark and collaborators (1998). In fact, the impor- 
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tance of long distance dispersal had been perceived much earlier for the founding 
of new genetically isolated populations (Wright 1943), for the spread of epidemics 
(Mollison 1995), and for the dispersal of pest species (Andow et al. 1990). 

At the regional and continental scales, seed dispersal is an important factor 
in determining plant species distributions. For example, many neotropical plant 
species are endemic to a sub-region. Gentry (1992) estimated that, in each of the 
9 phytogeographic regions in Central and South America, plant endemism was 
greater than 50% (except in the coastal zone of Colombia). This endemism cannot 
be explained only by the variety of environmental types, as evidenced by Prance 
in several speciose families such as Chrysobalanaceae (Prance 1973). In principle, 
mathematical models of long distance dispersal allow for dispersal events over 
any scale. However, the frequency of large-scale dispersal events is very low. If 
a seed is dispersed hundreds of kilometers away from any conspecific tree, and 
it does not die in the juvenile stage, this individual will be the founder of a new 
population, genetically isolated from the parent population, and therefore likely to 
initiate speciation (Brown and Lomolino 1998; Nathan 2001). 

Dispersal is a form of exploration conditioned by the scales of variation in 
a system. There is direct analogy here with the investment strategies or the 
degree of risk-taking in the management of organizations. Innovation is advant- 
ageous in a changing environment; how advantageous depends upon the scales of 
environmental variation. 



4.2. SCALE AND MARINE ORGANISMS 

Life in (moving) fluids presents fascinating challenges (Vogel 1994). Swimming 
organisms span a huge range of body sizes, from bacteria to whales. We shall focus 
on the evolutionary significance of this variation in the next section; but for the 
moment, we seek to understand what it means for the organism, to live in a moving 
environment. 

The dynamics of fluids are fully described by a well-known set of partial differ- 
ential equations, the Navier-Stokes equations (Batchelor 1953). One can accurately 
predict the flux of water transported though a pipe when the water velocity V is 
known. Yet, as the velocity increases, the flow becomes turbulent. The reason for 
this transition is that two processes are operating at the same time. At small velo- 
city, the momentum flux carries water in the pipe along linear streamlines (laminar 
flow). At high velocities, however, the laminar flow is destabilized by frictional 
forces. Dimensional analysis of the Navier-Stokes equations, which govern the 
dynamics of the fluid, gives an insight to this problem. To illustrate this, consider 
the Navier-Stokes equation, in one dimension, in the absence of gravitational effect 
or pressure: 



dv dv d^v 

h V — V — 

dt dx dx^ 



= 0 
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in which t is time and v is the viscosity (coefficient of friction). This equation 
describes the rate of change of velocity, 9V/9t, as a function of the spatial profile 
of fhis velocify. The momenfum flux is fhe ferm VdV/dx, and has fhe dimensions 
of V^jL, where L is fhe cross-seclion of fhe pipe. If x is measured in meters and 
V in meters per second, fhe friclional ferm is vd^V/dx^, which has fhe dimensions 
of vV/L^ (v has unifs square meters per second). The rafio of fhese fwo quanfifies 
is a basic scaling parameter, fhe Reynolds number 

V^/L VL 

Re = — = 

vV/L^ V 

Re has no dimension. A laminar flow is observed for Re < 2,000 in a typical water 
pipe, while furbulence is usually observed for Re > 40,000. 

The Navier-Sfokes equations are simply balance equations. One can incorporate 
many ofher processes fhaf drive fhe fluid dynamics, such as fhe change in pressure 
(fhis would be an additional term If pdP jdx, where P is fhe pressure and p fhe 
densify), or fhe gravify {—g, which is fhe accelerafion consfanf). Therefore, fhe 
relafive magnifude of fhe ofher processes af work can be summarized info ofher 
non-dimensional numbers, like fhe Reynolds number. 

The analysis of fhe Navier-Sfokes equafions in fhe furbulenf regime poses a 
formidable malhemafical challenge. An infriguing property of fhe furbulenf regime 
is fhaf eddies dissipate fhe energy over a broad specfrum of scales. This is often 
described in a temporal confexf, since air sensors are sialic. The energy-frequency 
specfrum E{o)) - fhaf is, fhe energy dissipaled for a given frequency a>, - is 
(Kolmogorov 1941; Balchelor 1953; Frisch 1995) 

E{co) ^ 

This scaling relationship can be franslaled in a spatial confexf, since large eddies 
are Ihose fhaf are rare (low frequency &>) and fhaf Iherefore dissipate fhe largesl 
amounl of energy E{co). The Kolmogorov - 5/3 scaling law has been observed 
in a large number of real fluids. The furbulenf regime is chaotic: in a furbulenf 
fluid, fwo configuralions infinilesimally close initially diverge exponentially fasl. 
Allhough fhe syslem is governed by deterministic (Navier-Sfokes) equations, fhe 
rapid divergence of inilially similar slales occurs. To furlher illuslrale fhe diffi- 
culty of analyzing fhe Navier-Sfokes equations, lei us menfion fhaf fhe existence 
or fhe non-exislence of “blow-up” solulions in fhe Ihree-dimensional problem 
(lornado-like solulions) is still nol esfablished, and recenf work suggesls fhaf 
no such singular solulion exisfs (Ya. Sinai, unpublished resulls). The sludy of 
furbulenf fluids is of paramounf imporlance for fhe currenl challenges of oceanic 
and almospheric sciences. Indeed, fhese equafions govern fhe dynamics of bolh 
environmenls and Ihey conslifufe fhe core of mosl globally coupled models of 
ocean-afmosphere inleraclion (Slull 1988). 

Obviously, if makes a big difference whelher fhe surrounding environmenl is 
furbulenf or if if is nol for a swimming organism. Since bolh velocity and size 
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increase with body weight, minute organisms live in a laminar and indeed quite 
viscous liquid, while fish live in a turbulent fluid. Sub-millimetric animals displace 
themselves using flagella; they use the same technique as snakes on the ground. 
This is not to say that these animal cannot move large distances; they can easily 
be carried by streams. In contrast, large fish have a highly optimized design. A 
most important optimization constraint, in this case, is the minimization of the 
energy needed to displace the fish’s body, a quantity called the drag. This is why 
fish have an aerodynamic shape, and well-designed body surfaces (Vogel 1994). 
Constraints are different for flying animals. Drag is not a crucial issue, because 
of the low density of air, but they must overcome gravity. Flying devices are 
wonderful instances of evolutionary design. The technical solutions are varied, 
from insects propelled by small wings, flapped at high frequency, to large birds, 
which take full advantage of wind upwelling streams. 

So far, we have considered the consequences of living in a moving fluid for 
a single organism; but at the level of a community of organisms, environmental 
constraints also have deep implications. For example, birds often assemble in 
geometric, vee-shaped groups, during their migration flight. This technique reflects 
community-level optimization: birds reduce their energy expense globally by this 
formation. This emergent pattern is therefore intimately related to the physical 
properties of the environment. Numerous examples reinforce this observation for 
other organisms, such as the clustered large-scale distribution of zooplankton in 
the ocean, likely due to a response to physical or chemical conditions (Levin et al. 
1989), or the multi-scale distribution of tree species in tropical rainforests (Plotkin 
et al. 2002). Importantly, the explanation for observed spatial distributions may 
differ across scales, say with exogenous forces dominating on broad scales, and 
endogenous forces on small scales. Spectral analysis shows a -5/3 scaling on broad 
scales, typical of turbulence, but much flatter distributions on finer scales (Levin et 
al. 1989). 

Spatial aggregation is a property of interest not only to ecologists, but to 
economists and other social scientists as well. Recently, Flierl et al. (1999) used 
numerical techniques coupled with methods of statistical mechanics (see section 5) 
to investigate the role of local interaction among organisms as an important cause 
of pattern formation in communities of marine organisms. We return to this in 
section 6. 

5, Power Laws in Physical Systems: How Do They Arise? 

A number of theories have attempted to provide a better understanding of the 
conspicuousness of power laws in Nature. Most of them, for example the idea of 
self-organized criticality, are closely related to the fundamental concept of critical 
transitions. 
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5.1. CRITICAL SYSTEMS AND RENORMALIZATION 

When one heats water to the boiling point, or cools steel down to the Curie tempera- 
ture, the system undergoes a dramatic transition, whence the term “critical”. The 
state of water changes from liquid to gas, whereas steel below its Curie tempera- 
ture has a spontaneous magnetization (Stanley 1971; Yeomans 1992; Binney et al. 
1992). In this case, the transition is from an ordered state (liquid, ferromagnet) to a 
disordered state (gas, paramagnet). In the words of Kauffman (1993), the transition 
point is a “sweet spot” between a too-ordered system and a too-disordered one. 
It has been observed experimentally that, in the vicinity of the transition, many 
physical quantities vary according to a power-law as the parameter varies (Stanley 
1971; Binney et al. 1992). A conceptual breakthrough in the study of these systems 
was the realization that although a vast number of such transitions are observed in 
Nature, they all fall into a few classes, referred to as universality classes. 

The simplest example of critical transition is encountered in porous media or 
fractured rocks (Broadbent and Hammersley 1957). The question is “What is the 
chance that water will percolate through the medium?” Obviously, this question is 
related to the fraction of accessible pores in the medium. To model this system, 
it is helpful to think of a lattice with open and closed bonds. Let us consider a 
taxi driver in Manhattan, New York. He wants to drive from the upper side to 
the lower side, without respect for the one-way signs, and without knowing its 
direction beforehand. However, 1 — p of the streets are jammed. Then, it can be 
shown that the taxi driver succeeds in his enterprise if only if p > 1/2 (Kesten 
1980). This result actually holds only in the limit of an infinite system size. The 
probability of percolation, when p is just slightly larger than 1/2, behaves as (p — 
1/2)^, where fi = 5/36, and a set of exponents characterizes this transition. At 
criticality, that is at p = 1 /2, the set of occupied bonds is fractal, and the fractal 
dimension is d/ = 91/48. A very useful approximate scheme makes full use of 
this scale invariance: It is the renormalization procedure. Since the system is scale- 
invariant, its properties at criticality should not be much changed when the scale 
unit is varied. Therefore, let us take a scale twice as large (Reynolds et al. 1977). In 
Figure 5, we show how open and closed bonds can be redefined by using a larger 
unif of measuremenf. After fhe fransform, blocks of 2 x 2 bonds are considered 
as a single bond, eifher open (if fhe underlying 2x2 nefwork is open) or closed. 
Thus, knowing fhe fraction of open bonds pi al scale k = 1, we find fhe fraction 
of open bonds p 2 al scale k = 2. The relationship is in general valid from scale k 
lo scale k+ and if reads 

Pk+i = Pk + ^Pk^i^ - Pk) + 8pi^(l - Pk)^ + 2pk^{l - pk? 

The equalion here deals explicifly wilh scale, for if relates fhe slate of fhe system al 
scale k lo ils slate al scale k+\. The key observation, made by Kadanoff (1966), is 
lhal Ihe system al crilicalily should be scale invarianl, so lhal Ihe critical p should 
be a fixed poinl (i.e., a slable rool) of Ihe above equation. The only fixed poinl 
of Ibis equation is al p = 1/2. This Ihreshold usually depends on Ihe choice of 
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Figure 5. Description of the real-space renormalization procedure for the bond percolation 
problem used by Reynolds et al. (1977). Left: step 1 corresponds to a change in scale, from 
scale 2* to scale 2*"*“'. Step 2 corresponds to the renormalization procedure. During this 
procedure, an eight-bond cell is replaced by a two-bond cell. Right: examples of the renor- 
malization of a few of the possible 8-bond cells. The renormalization rule is identical for the 
horizontal bond and for the vertical bond: if there exists an open path from the left to the right 
then the renormalized horizontal bond is open. 



the renormalization scheme; the fact that p coincides with the exact threshold is 
fortuitous. One can also use this equation to estimate a wealth of critical exponents. 
Moreover, a fascinating correspondence between the percolation problem and a 
model of dynamic critical transition, the Ising model, has been found (Fortuin and 
Kastelyn 1972). The study of the static model of percolation therefore provides 
valuable information on the equilibrium state of a dynamical system. 

The techniques of the renormalization group were first developed as a tech- 
nical tool in quantum field fheory (Bogoliubov and Shirkov 1959), where mosf 
of fhe fheory can be dealf wifh in an absfracf space (Fourier space). In confrasl, 
Kadanoff’s renormalizafion is referred fo as “real-space” renormalizafion (Wilson 
and Koguf 1974). The nolion of critical fransilions is of greal potential inleresl, bofh 
in ecology and in economics. In bofh disciplines, fhe spatial (or more absfracf) 
organizalion of individual agenls info inferacling nefworks can determine fhe 
emergence of pafferns (cities, aggregafions of animals, human sociefies, frophic 
webs) and fhereby influence fundamenlally fhe flows (capilal, informalion, energy. 
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disease). Thus, extensions of the Ising model, such as interacting particle systems, 
have become the objects of quantitative study in examining these problems (Durrett 
and Levin 1994). 

5.2. SELF-ORGANIZED CRITICALITY 

As just discussed, empirical studies have shown that when a parameter (tempera- 
ture, occupation of the percolation lattice, reproduction rate) is tuned near the 
transition point, these systems become insensitive to the scale of study; that is 
they become scale-invariant. Bak, Tang and Wiesenfeld (1988), have suggested that 
complex systems of interacting agents “evolve naturally towards a critical state, 
with no intrinsic time or length scale”. For example, a pile of sand constructed by 
slowly adding one grain of sand at a time, eventually reaches an angle for which 
the addition of one grain can cause an avalanche. They have sought to apply this 
idea to systems as diverse as earthquakes (Sornette and Sornette 1989), the species 
extinctions observed in the fossil record (Bak and Sneppen 1993), and forest fires 
(Malamud et al. 1998). Kauffman (1993) has suggested that such concepts should 
also apply to economic and social systems. Ecosystems and societies, however, 
are not sandpiles; they are made up of heterogeneous assemblages of individual 
agents interconnected through irregular networks. Although valuable as a concept, 
self-organized criticality is too simplistic a notion for most natural systems (Levin 
1999). 

5.3. CRITICALITY IN ECOSYSTEMS 

The classical model of the dynamics of a single species is the Verhulst equation 

du{t) 2 

= bu{t) — mu(t) — au(t) 

dt 

In this, individuals produce offspring at a rate b, and die at a rate m{b > m). 
The population reaches its carrying capacity u* = {b — m)/a, after an initially 
exponential growth. Such a model also applies to the growth of the population 
of a city. The borderline region where the reproductive ratio Rq = b/m close 
to one is often observed in real communities. Exactly at /?o = 1> the population 
undergoes a critical transition and exhibits non-trivial properties. Eor example, P{t), 
the population’s survival probability, decreases as P{t) ~ \/t. In addition, for the 
model on a lattice with limited distance dispersal, Rq should be at least Rq > 1 
(Harris 1974), and it is observed numerically that 

P{t) ^ 

where h = 0.451 in two dimensions of space. Note that the non-spatial theory 
predicts 5 = 1. Also, starting from a small population cluster, the number of indi- 
viduals grows over time as N{t) ~ t®, with 9 = 0.230. One may also wonder how 
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an initially small population cluster spreads over time. The radius R{t) (averaged 
over surviving populations) also grows as a power law of the time R(t) ~ with 
z = 1.73 in two dimensions (z = 2 in the non-spatial model). The three exponents 
8,9, and z are independent of each other, and are related to other exponents, such 
as the fractal dimension of the set of occupied sites, df = zO jl. A recent review 
of this theory is given in Hinrichsen (2000). Note that the simple on-lattice popula- 
tion model depicted above also undergoes a critical transition around Rq = Rq, 
belonging to the so-called “directed percolation universality class”. This univer- 
sality class is characterized by a set of scaling exponents, among which are 8, 9, and 
z. Indeed, virtually any quantities satisfy power laws near the critical point, and the 
behavior of many quantities in the vicinity of the critical point are given by func- 
tions of (Rq — Rq‘^). These exponents take non-trivial values in low-dimensional 
system (dimension d = I to d = 4), and they are related by a relationship between 
exponents, referred to as the hyperscaling relation (Grassberger and de la Torre 
1979) 




z 



When more than a single species is present in the community, interactions drive the 
important processes, and shape community-scale patterns. In a real community, 
competition interacts with dispersal and differential mortality patterns in a non- 
trivial way. To test for all possible combinations of processes, factorial experiments 
can in theory be set up, but these are difficult. Models provide an alternative insight 
into this problem. Early models of species coexistence were built upon Lotka- 
Volterra equations, which describe the competition for resources. One of the main 
conclusions of this approach is that two species feeding upon the same resource 
cannot coexist. A number of theories have been offered to explain this apparent 
problem. We have already seen that at the largest scale, dispersal and speciation 
play an important role, together with environmental variations (climate, rainfall, 
matrix type). At such large scales, interspecies competition is irrelevant. It becomes 
important, however, at smaller scales. A simple individual-based model such as the 
one discussed earlier can serve as a basis to address the issue of species richness 
and to provide theoretical predictions for the species-area curves. In the simplest 
model, it can be shown that the species with the largest reproductive ratio always 
drives all other species to extinction. However, if a tradeoff is incorporated, the 
species with largest reproductive ratio also is the poorest competitor. In this case, 
a non-trivial equilibrium is reached (Levins 1969; Buttel et al. 2002; Chave et al. 
2002). Figure 6 shows the spatial arrangement of coexisting species at different 
instants of the simulation. A power-law species-area relationship is observed. We 
observe that this relationship does not come from the critical nature of the system. 
Other scaling laws for abundance distributions have also been derived (e.g., Kinzig 
et al. 1999). 
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Figure 6. Output of a spatially-explicit model of biodiversity: the tradeoff model. Each species 
possesses a different death rate and the same birth rate. Species with the smallest death rate 
are at an advantage over the other. However, these species are poorer competitors: when they 
compete for a site with a species with a larger death rate, they always loose. At early times 
(t = 16 to t = 64), there is little interspecific competition, and monospecific domains grow as 
in the supercritical contact process. However, at later times, competition becomes important, 
and strategies with a large death rate, but with little habitat limitation, are increasingly often 
observed. 



6. Statistical Mechanics and Kinetic Theory 

The utility of the concepts developed in the previous section make one may wonder 
whether these could apply to other systems, which lack such a well-defined mathe- 
matical representation. A crucial remark is that the large-scale patterns observed in 
complex adaptive systems such as markets, social interactions, and ecosystems are 
generally not the mere superposition of an assembly of organisms, but the result of 
interactions among these “agents”. This situation is reminiscent of what happens 
in a gas: each particle performs a complex displacement, changing its velocity and 
orientation during the interactions with other particles. However, the gas as a whole 
has well-understood properties, and these properties are not directly related to the 
history of each particle in the gas. For example, the temperature of a gas is a purely 
macroscopic concept, and it does not apply at the scale of a particle. 

We here briefly sketch the method of statistical mechanics used to find the 
macroscopic kinetic equations of an assembly of interacting organisms, and we 
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shall continue to refer to the example of gas particles. We assume that the particle 
is punctual, described by a position vector x and an instantaneous velocity vector v. 
The elementary displacement of the particle, 8x, depends on the value of the velo- 
city. In addition, an interaction of the particle with its environment is recorded by 
a displacement (5x', 5v') from the position (x, v). The total microscopic equation 
therefore is: 



Sx = vSt + SXe (1) 

Sv = aSt + Svg 



where a is the acceleration of the particle. If the particle is solitary, Newton’s 
equation F = ma (force exerted upon the particle equals the particle’s mass 
times its acceleration) closes the description of the system. The external displace- 
ments (Sxe, Sve) can be taken as random variables, and we are left with a set 
of two coupled stochastic differential equations, called Langevin equations. The 
vector (8xe,8ve) contains the information of the action of other particles, while 
F describes other forcings (e.g., an electric force, if the particle is charged). The 
global description of a large assembly of such particles requires the definition of 
a configuration (or microstate). This is just the value of the vectors (x, , v,), for all 
the particles in the system. If we have N particles, the system has 6N free variables, 
and a configuration is a point in a 6N-dimensional space called the phase space. 

The heart of the kinetic theory consists in writing down the balance equa- 
tion for the number of particles contained in a small volume of this phase space 
(cf. Appendix). The resulting differential equation can then transformed into 
Boltzmann’s equation by neglecting collision terms (Sxg, 8vfi in equations (1). 

In a more general setting, forms of the Fokker-Planck equation can be deduced 
directly when the particles are considered as random walkers, i.e., when SVe = 0. 
A large assembly of - interacting or non-interacting - random walkers possesses 
important macroscopic properties. In the non-interacting case, the dynamic equa- 
tion (equation 4, appendix) becomes: 



dn(X, t) 
9f 




9 

dX^dXfi 



Dapn{X, t) 



( 5 ) 



which simply is the diffusion equation. The matrix Dap is the covariance matrix 
{8Xea8Xep), also Called the diffusion matrix (this reduces to the diffusion coeffi- 
cient in one dimension of space). 



7, Analysis of Multiple-scale Systems 

Interfacing ecological and socioeconomic systems presents fundamental chal- 
lenges because the dominant scales on which these systems operate are so different. 
Indeed, the problems associated with multiple scale systems are not unique to 
the interface: Any ecological system (Levin 1992) or any socioeconomic system 
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involves the interplay of proeesses operating on diverse seales of spaee, time and 
organization. Individuals are organized into families, families into populations, 
populations into species, species into communities, and communities into global 
assemblages. Small-scale processes, operating typically over relatively fast time 
scales, become integrated into larger scale processes, operating over longer time 
scales. Events in ecological time are shaped by evolutionary processes, and in turn 
drive new episodes of evolution. 

These problems are not unique to these systems; scaling problems are among 
the most pervasive in the physical sciences as well. General circulation models, 
which are used to predict climate change and other phenomena, require translating 
small-scale processes into global patterns of circulation, and in turn using those 
models to predict changes at local scales. This leads to the need for multigrid 
methods, with varying degrees of resolution at different spatial (and temporal 
scales). Temporally, the problem of multiple scales also represents an active area 
of research. If the scales are sufficiently distinct, so that one may be regarded as 
slow enough that its variables are constant on the fast scale, numerous methods 
exist, such as singular perturbation theory, to deal with the systems. Guckenheimer 
(2000) identifies fhree complemenfary approaches fo such problems: nonsfandard 
analysis, “classical” asymptotic mefhods, and geomefric singular perfurbafion 
fheory. The developmenf of numerical mefhods fo explore such phenomena is an 
acfive area of research. 

When fhe scales in question are very differenf, subsfanfial simplification is 
possible by freafing fhe variables on fhe slow scale as consfanfs for fhe purpose of 
fhe fasf-fime-scale dynamics, whose asymptotic behavior is fhen assumed fo frack 
fhe movemenf of fhe slow variables on fhe slow fime scale. In enzyme kinefics, 
fhis “quasi-sfeady-sfafe” assumption allows one fo assume fhaf enzyme-subsfrafe 
complexes reach an “equilibrium” concenfrafion, defermined by fhe free concenfra- 
fions of enzyme and subsfrafe, fhereby reducing fhe dimensionalify of fhe sysfem 
on fhe fasf fime scale. Formally, fhe mafhemafical jusfificafion for such assump- 
tions are fo be found in fheorems like fhose of Tykhonov (1952), buf infuifively 
fhe idea is clear: On fhe fasf fime scale, fhe slow variables are freafed as consfanfs, 
while on fhe slow fime scale, fhe fasf variables are assumed fo be in equilibrium. 
Through such an assumption, for example, fhe classical Lofka-Volferra predation 
equafions (wifh damping) collapse to a logisfic equation for fhe predator if if is 
assumed fhaf resource dynamics are much fasfer fhan predator dynamics. 

Formally, fhe mefhods of fhe lasf section should be replaced by singular perfur- 
bafion mefhods, which accounf explicifly for fhe relafive sfrengfh of fime scales. 
Treafmenf of singular perfurbafion fheory is beyond fhe scope of fhis paper, buf fhe 
reader is referred fo fhe excellenf discussion in Fin and Segel (1974), in which fhe 
enzyme kinefic problem is freafed explicifly. 

Mulfiple-scale mefhods have been very influential in economics as well, espe- 
cially since fhe seminal paper of Simon and Ando (1961). Simon and Ando address 
fhe problem of aggregation info sectors, infroducing formal nofions of sysfem 
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decomposability. In particular, they focus on “nearly decomposable” systems. 
Based on the network of interactions among agents or firms, a system may be 
decomposed into clusters within which there is strong interaction, and among 
which interactions are weak. When this is done, one treats within-cluster dynamics 
as operating on fast time scales, which reach quasi-steady-states that then interact 
on the slower time scale. Fisher and Ando (1971) extend this work further, dealing 
with “completely decomposable” systems, whose internal dynamics may be treated 
as autonomous on the fast time scale. 

Of course, there are problems with this approach, not the least of which is the 
validity of the assumption of separation of scales, and the notion that clusters 
reach steady-states on the fast time scale. When the asymptotic dynamics are 
more complicated, for example chaotic, these simple results do not apply (but see 
Earn et al. 2000). Second, the approach is essentially linear; more generally, the 
network of connections will not exhibit constant strengths, and methods are poor 
for dealing with systems of changing connectivities. Yet the problem of aggregation 
and dimensional reduction remains a central one, and a rich area of research (Ijiri 
1971; Iwasa et al. 1987, 1989; Li and Rabitz 1991a, b). 

When scales are separable, as in the earlier examples, the analysis on the 
slower time scale still can introduce complexities. As the slow variables change, 
the fast equilibrium (if it exists) may change slowly for a while, but ultimately 
reach thresholds (bifurcation points) at which qualitative changes take place in 
the phase portrait. This may result in a discontinuous jump from one equilibrium 
point to another, a transition to periodic or chaotic behavior, or other transitions. 
Technically the subject of bifurcation theory, such phenomena also fueled interest 
thirty years ago in catastrophe theory (Thom 1983). Catastrophe theory attempts to 
catalogue the fundamental kinds of transitions that can take place. Because cata- 
strophe theory usually did not deal with systems with known dynamics, but tried 
to intuit underlying dynamics from observed macroscopic behavior, its application 
generated considerable resistance, and is today much less in evidence outside pure 
mathematics (ArnoTd 1986). Bifurcation theory, however, which begins from the 
underlying dynamics and explores the consequences for macroscopic behavior, is 
a very powerful and popular tool. 

The simplest form of bifurcation in Thom’s generic scheme is the cusp cata- 
strophe, in which one equilibrium loses stability at a critical threshold in the slow 
variables, and the system jumps to a new equilibrium. Epidemic systems may 
fit this scheme neatly, as the level of susceptibility in the population reaches a 
threshold level at which outbreaks can be sustained. Ludwig et al. (1978) exploited 
this structure in their consideration of the dynamics of the economically important 
pest, the spruce budworm. In their model, when the budworm is at low densities, 
forest quality improves on a slow time scale. When forest quality has reached a 
sufficiently high level, the budworm population outbreaks, leaping from its low 
endemic level to a new epidemic one. Now, with the budworm at epidemic levels, 
forest quality gradually declines until it reaches a point at which the budworm 
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Figure 7. Spruce budworm and cusp bifurcation. A: equilibria of the spruce budworm-forest 
interaction (circles). The central circle is unstable, while the two other equilibria are stable. R 
is the reproductive rate of the budworm. B: cusp bifurcation. Depending on the slope and 
on the position of the straight line in panel A, one or three equilibria can be found. The 
curve separating these two regimes is cusp-shaped. Adapted from Ludwig et al. (1978), with 
permission. 



can no longer be sustained; the budworm then crashes back to endemic levels 
(Figure 7). This is an example of a relaxation oscillation. The threshold for explo- 
sion is not the same as the threshold for collapse; between the two, the system can 
exist in either of two quasi-steady-states, depending on history. Thus, it exhibits 
hysteresis. 

The cusp catastrophe has also enjoyed recent influence in the field of environ- 
mental economics, through attention to the dynamics of lake systems, especially 
shallow lakes, subject to eutrophication (e.g.. Carpenter et al. 1999; Arrow et al. 
2000). In the shallow lake example, the level of phosphorus is a balance between 
processes that inject the element into the lake (loading), and those that clear it 
(purification). For intermediate levels of purification, as phosphorus loading is 
increased, the system may jump from a low (oligotrophic) phosphoms level to a 
high (eutrophic) one, which may be refractory to amelioration. 

The luxury of easily separable scales does not apply to all systems; obviously, 
when we cannot so separate scales, the dynamics become even more compli- 
cated. For example, many systems have intrinsic periodic dynamics, with their own 
natural scale of fluctuation. When such systems, for example host-parasite systems, 
are forced say by annual periodicities, things are relatively simple if the scales are 
very different- one fluctuation will be superimposed on top of another. When the 
scales are not too different, however, the periodicities interfere with one another, 
and may give rise to chaos. Chaos, of course, can also arise in systems that are 
characterized by multiple endogenous periodicities, as when turbulence arises in 
fluid dynamics, or chaos in population dynamics (May 1974). 

So far in this section, most of the focus has been on time scales. The discussion 
of Simon and Ando’s work, however, makes clear that temporal and organizational 
scales interact, as do spatial and temporal ones (Figure 8). It is well beyond the 
scope of this paper to discuss the problem of pattern formation in spatial systems, 
and the interplay between space and time (but see Levin 1979, 1992). In general. 
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Figure 8. Zooplankton biomass variability across scales (Stommel diagram). Processes are 
mostly observed along a straight line of increasing spatial and temporal scales. After Haury et 
al. (1978), reprinted in Levin (1992), with permission. 



however, pattern formation emerges from the interplay between processes that 
break symmetry, and those that stabilize non-uniform structures. Alan Turing, in 
a classic paper (Turing 1952) motivated by problems in developmental biology, 
showed how local interactions could give rise to global patterns, through the 
interplay of activation and inhibition. Turing hypothesized the existence of two 
chemicals, termed morphogens, one an activator and the other an inhibitor. In his 
scheme, both chemicals diffuse, but the inhibitor does so on a much broader scale. 
It is possible then to show that local (short-range) activation breaks symmetry, and 
long-range inhibition stabilizes it (Segel and Levin 1976; Levin and Segel 1985). 
Pattern arises, then, through the interplay between processes operating on very 
different scales. In general, the problem of pattern formation, and the interplay 
between activation and inhibition, is of deep importance for understanding the 
organization of societies and economies. 



8. Synthesis: Scale and Complex Adaptive Systems 

The problem of scale is by no means a purely conceptual one. Careful analyses 
of the scaling properties of biological systems have had far-reaching consequences 
in the theory of evolution. Similar reflections have led to major achievements in 
physics, for example in the study of turbulence, and in unveiling the universal 
properties of systems near critical transitions. New techniques have emerged for 
detecting scales (wavelet transforms), and others have exploited the consequences 
of scale invariance (renormalization group methods). 
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It is obviously easier to derive robust experimental results with small systems 
than with large ones. However, problems such as the stability of the climate of 
the Earth cannot be addressed at the regional scale, and even the current globally 
coupled ocean-atmosphere-biosphere models can only describe the climate of the 
Earth on a resolution of half a degree (~55 kms). They capture many of the large- 
scale feedbacks, such as the El Nino oscillation, otherwise inaccessible. However, 
the incorporation subgrid scale processes - what is going on inside the 0.5 x 0.5 
degree cells - is of fundamental importance. 

However we have here made a more ambitious proposal. We have suggested 
that scaling concepts offer an avenue to study heterogeneous assemblies for which 
the microscopic processes are not known, and probably not knowable (e.g., in the 
case of social systems), except in terms of their statistical properties. These systems 
are complex adaptive systems (Levin 2000), which must be studied across levels 
of organization too. 
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Appendix 



Now, let us consider the macroscopic scale, which is experimentally accessible. Practically, 
one is interested in the density of particles in a domain of size dX around a location X, and 
within a range of velocity dV around V, n(X, V)dXdV. We use capital letters to avoid 
possible confusion between the microscopic variables (xi, v, ) and the macroscopic ones 
{X, V). The rate equation for n(X, V) is: 

n(X\V\t + St) ^ f P(X',V'\X,V;SXe,SVeMX,V,t)dXdV (2) 

Jq 

where P(X' , U'| Y, V; SXe, SVe) is the transition rate from the elementary volume around 
(Y, V) in the phase space to the elementary volume around (Y', V'). This equation 
describes the dynamics of the density n(Y, V). At this point, we are dealing with a macro- 
scopic equation; (Y, V) is not the position of a particle, but that of a small volume of 
“fluid” made out of a large number of particles. Well-known methods exist to transform 
equation (3) into a tractable dynamic equation. The simplest form is Boltzmann’s equation. 
Assuming that the collision term (&Xe, &Vg) is negligible in equation (1), the microscopic 
description coincides with the macroscopic one, and we rewrite (1) as: 

n(X + VSt, V + ASt, t + St)^ n(X, V, t) 



where A is the acceleration (force per unit mass) of the element of system. This is equiva- 
lently rewritten P(X' , V'\X, V; SXg, SVe) = P(X +V&t, V -|- AiSf | Y, V). Taking the limit 
i5f ^ 0 yields the following differential equation 



dn(X, V, t) 
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where Y = {Yi, Y2, Y3}. Now, if collisions are present, we can generalize equation (3) 
by: 
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This is Boltzmann’s equation. Of course, the interesting part of this equation is the right- 
hand term, and one must deduce it from the microscopic laws (1) and from the conservation 
of momentum {miv\ + ni 2 V 2 — cte) and of kinetic energy + ni 2 V 2 ^ — cte) during 

a collision. This can be carried out using macroscopic arguments, as in the Chapman- 
Enskog derivation (Chandrasekar 1943). A partial differential equation, the Fokker-Planck 
equation, can be used as an alternative kinetic equation in certain limiting cases: 



dn{X, y, t) 
9f 




n(X, V, t) 



1 

2 



E 



dVadVfi 






n(X, V, t) 



(4) 



Equation (4) is valid in the absence of random hops (SXg = 0). In this case, the matrix Eq.^ 
is the covariance matrix {S Vga S Vgp ) . Eor a discussion of this approach for natural systems, 
see Elierl et al. (1999) and Levin (2001). The above procedure can also be directly derived 
from the microscopic theory by constructing the Liouville equation, a microscopic equiva- 
lent of equation (3), in which collisions are added. This procedure is due to Bogoliubov 
(1946), and often referred to as the BBGKY procedure (after its authors, Bogoliubov, Born, 
Green, Kirkwood, and Yvon; see Ferziger and Kaper 1972). 
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Abstract. The relationship between grass production and the quantity of trees in mixed tree-grass 
ecosystems (savannas) is convex for all or most of its range. In other words, the grass production 
declines more steeply per unit increase in tree quantity at low tree cover than at high tree cover. 
Since much of the economic value in savannas is ultimately derived from grass, and the main mecha- 
nism controlling the tree-grass balance is dependent on the production of grassy fuel for fires, this 
non-linearity has the effect of creating two savanna configurations. One has a low tree density and 
supports a viable grazing enterprise, while the other has dense tree cover and a frequently non-viable 
grazing enterprise. The non-linearity is suggested here to have two main sources: the geometry of 
the spatial interaction between tree root system and grasses, and the effect of differing phenology 
(the time course of leaf area exposure) on the acquisition of water and nutrients. The existence of the 
non-linearity reduces the resilience of the generally-preferred “open” configuration, and increases 
the resilience of the less-desirable “closed” configuration. 

Key words: competition, primary productivity, resilience, savannas 



1. Introduction 

A large fraction of the Earth’s land surface is covered by ecosystems in which 
the plant production is generated from mixtures of trees and grasses. The most 
extensive case is the approximately one-eighth of the surface occupied by tropical 
savannas (Scholes and Hall 1996). Similar ecosystems occur outside the tropics as 
well, and in certain cultivated systems. 

The key issue in the management of these systems for human benefit relates 
to the proportions of the mixture, since more tree means less grass. Where the 
principal benefits are based on the grass component, such as in grazing systems, the 
strategy seems obvious: maximise grass production. In the short term, maximum 
grass production is typically (but not always: see Eigure 1) achieved at zero tree 
presence, but since removal of the trees is a significant and recurring expense, and 
may incur other environmental costs, the objective typically involves finding fhe 
level of free presence fhaf sfrikes a balance befween loss of grass producfion and 
loss of ofher ecosysfem services. 

The quanfify of woody planfs in fhe sysfem can be measured in a variety of 
ways: free canopy cover percenfage, free biomass, leaf area, sfand basal area or 
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Figure 1. Empirical relationships between the quantity of trees in the system (measured in 
various ways, and here normalised so that the largest tree quantity reported in the study is 
equal to 1) and the aboveground grass production. 



number of trees per unit area. The last-mentioned is not a good measure, since 
most systems have many small trees that have little functional impact on grasses. 
All the other “treeness” measures are highly correlated with one another and with 
impacts on grass production. Unless otherwise stated, this paper uses stand basal 
area (i.e., the sum of the cross-sectional area of the stems near the ground level, 
per unit ground area, m^/ha) as the index of tree quantity, and it is here given the 
symbol X. Note that this is a measure of “stock”, whereas in general the grasses 
are measured in terms of production (g/m^/y), which is a “flux”. 

Grass tillers are short-lived (< 1 year), so the useful and grazable stock 
is more-or-less equal to the within-growing-season production. The use of this 
mixed terminology has the potential for causing confusion, particularly in grap- 
hical analysis where the state of the system is represented by the stocks of the 
two components. For the key analysis in this paper, the grass production flux is 
converted to a stock by accumulating it, less losses due to fire, herbivory and decay. 

The trade-off curve between the stand basal area and the grass production is 
central to both the economic use of savannas and their ecological dynamics. In 
every recorded case it has a strongly non-linear form. The grass production declines 
as tree basal area increases. In almost all cases this relationship is convex over its 
entire range; i.e., the grass production declines more rapidly for the initial incre- 
ments in tree basal area than it does for subsequent increments (Scholes and Archer 
1997). In a small proportion of cases, there is little or no impact on grass production 
(or even a slight increase) for the first increments of tree basal area above zero, 
followed by a transition to a inverse convex relationship above a certain level of 
basal area per hectare (Figure 1). 
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Savanna systems around the world have been observed to be unstable in terms 
of the proportions of the mixture, under eertain forms of management. In particular, 
following the introduction of sustained heavy grazing by domestic stock, usually 
accompanied by reduction in the fire frequency and intensity, the quantity of trees 
tends to abruptly and apparently irreversibly increase, often to a point where the 
grass production is so suppressed that pastoralism becomes commercially unviable. 

This paper addresses two questions: 

• what is the mechanistic basis of the observed convexity in the grass production 
vs tree basal area curve? 

• does the shape of this curve have anything to do with the abrupt transition 
between different configurations of tree-grass systems? 

The analysis presented here is one way of conceptualising savannas. Like all 
ecosystems, savannas can be viewed through a variety of lenses, whose validity 
depends on the objectives of the study. The relatively simple and abstracted 
model presented here aims to explain the widely-observed non-linearity in grass 
production as a function of tree cover, and to explore its consequences for system 
dynamics. It does not address species composition dynamics within the grass or 
tree layers, nor the complexities of grazing behaviour. 

2, The Convex Relationship between Tree Basal Area and Grass Production 

In the semi-arid to dry subhumid climates that dominate the tropical savanna 
extent, aboveground grass production, in the absence of tree cover and accumu- 
lated over the period of the wet season (i.e., g DM/m^/y, where DM stands for 
dry matter), has repeatedly been shown to be a linear function of rainfall received 
during the growing period (equation 1; Rutherford 1980; Scholes 1993). This has 
been proposed as evidence that grass production in these systems is water-limited; 
I suggest that it shows that grass production is both water and nutrient limited 
(Scholes 1997). For instance, fertilisation of these grasslands with nitrogen leads 
to a relation that is still linear, but with a steeper slope. The simple interpretation is 
that water availability controls the duration of grass growth in these episodically- 
wetted soils, while nutrient availability determines the rate of growth when water is 
available. The slope of such a linear relationship between Rainfall (R) and above- 
ground grass production (Pg) is a reflection of the nutrient availability in the system. 
It is sometimes referred to as the “rain use efficiency” (le Houerou 1984). The 
intercept of the line, on the other hand, reflects the availability of soil water. For 
instance, more water is needed to commence production on a clayey soil than on a 
sandy soil because of the greater initial losses to evaporation and runoff on clayey 
soils. This is clearer when the equation is written in the form of a slope (a) and an 
x-axis intercept (c), rather than the more usual y-axis intercept: 

Pg = a(R - c) (1) 

A compilation of data from a variety of savanna-climate grasslands provides the 
following empirical predictors of the slope and x-axis intercept, as functions of the 
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sand content of the soil (s, as a percentage). For soils between 64 and 92% sand 
content 

a = -0.0376s + 3.442 (2) 

For soils above 92% sand content, a can be given the constant value of 0.1, and 
below 64%, the constant value of 1.1 g/m^/y/mm. The value of c is a function of 
the value of a, since they are both ultimately related to soil texture: 

142 

c = 328 (3) 

a 

Measurement of tree radial increment growth (the growth revealed by tree rings, or 
by measuring the increasing circumference of the stem) shows that wood produc- 
tion is related to both rainfall and “site quality”, a factor that includes soil depth, 
landscape position and fertility. Primary production by trees is thus also related to 
water and possibly to nutrient availability, but in less direct ways than in grasses. 
This is because trees have a greater storage capacity for water, nutrients and carbo- 
hydrates than grasses do, and have a greater rooting volume and depth. Thus in 
many circumstances they are able to carry over a significant quantity of water, 
nutrients and carbohydrates between years, as well as conserving and recycling 
a substantial fraction of their nutrients internally at the time of leaf drop. Thus 
production by trees in a given year needs to consider rainfall not only in the current 
growing season, but in the previous one as well. 

When trees and grasses grow together, several sometimes opposing interactive 
mechanisms are operative (Scholes and Archer 1997): 

1. Competition for water and nutrients between the root systems of the trees and 
grass; 

2. Reduction of the photosynthetically-active radiation and rainfall reaching the 
grass canopy, through prior interception by a tree canopy; 

3. Improved growing conditions for the grasses immediately below the canopy of 
a tree due to local nutrient enrichment and improved soil water conservation; 

4. A risk of scorching for the trees when the flammable dry grass burns. 
Although trees and grasses may be occupying the same land area, their root systems 
are not necessarily occupying entirely the same volume. The grass cover is usually 
characterised as “continuous” whereas the tree cover is “discontinuous”. Thus, in 
purely geometrical terms, there may be areas where the grasses are relatively free 
of tree influence, simply because they are in the interstices of the tree rooting radii 
(Figure 2). 

Since the root reach of savanna trees has been shown to be several multiples of 
the crown radius (Rutherford 1983; Groot and Soumare 1995), a patch of savanna 
can be considered to comprise three more-or-less spatially distinct sub-habitats: 

1. the area immediately below the tree crowns, where the grasses are subject to 
shading and the interception of water by the crowns, and although exposed to 
root competition from the trees, generally benefit from the enhanced nutrient 
status under the crown and a lower evaporative demand; 





Figure 2. The fraction of the vegetation patch that is beneath tree canopies and between 
tree canopies can be calculated from knowledge of the tree canopy diameter distribution and 
the spatial dispersion pattern of trees in the landscape. For simple distributions of both an 
analytical solution is possible, but for complex distributions, an empirical fit to a numerical 
solution is more practical. The curves described here assume random spatial distribution, a 
crown diameter 10 times greater than the stem diameter, and a root diameter 2.5 times the 
crown diameter. 



2. the area within the reach of tree roots, but beyond the crown, where shading is 
minor, but there is competition with trees for water and nutrients; and 

3. the area beyond both the crown and the reach of tree roots. Grass production 
here should be the same as grass production in the complete absence of trees. 
This area is small in savannas with a high tree cover, but can be significant in 
sparsely-treed savannas. 

The aboveground production by grasses (Pg) in a mixed tree-grass situation can 
therefore be expressed as 

Pg = PgiA-i -h Pg2Ai + Pgs^B (4) 

where Ai, A2 and A3 are the fractions of the patch occupied by the subcanopy, tree 
root zone and tree root-free subhabitats respectively, and Pgi, Pg2 and Pg3 are the 
grass productivities per unit area in each zone. 

The area of each subhabitat can be calculated geometrically if the distributions 
of tree sizes are known as well as their spatial distribution. The tree size distribution 
in undisturbed, self-regenerating populations follows an “inverse J” distribution, 
i.e., an exponential decline in the numbers of individuals in each successive 
size class due to the effect of tree mortality. It may deviate substantially from 
this ideal form - for instance, many populations show episodic recruitment and 
mortality, giving multi-modal size distributions. The default assumption regarding 
spatial distribution is that the location of each tree is random (i.e., independent of 
every other tree). This assumption is approximated in some naturally-regenerating 
populations, but not all. Clumping (at several scales, and operating differently for 
different species in a mixture) is commonly observed, and so is “over-dispersion” 
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(i.e., a tendency towards regularity, with its extreme form in the hexagonal spacing 
of an orchard). Since both the size and location distributions can be quite complex, 
this paper calculated a family of empirical predictors of Ai and A 2 , based on the 
output of a numerical simulation fed with a range of feasible spatial and size-class 
assumptions. Thus, for a randomly distributed population of trees with an inverse 
J size distribution, and where the canopy radius is 10 times the radius of the stem 
(a typical value: Scholes et al. 2002), 

Ai = 1 - (5^ 

Similarly, for trees where the effective root radius is 25 times the stem radius, the 
area of the subhabitat outside of the canopy but still within the tree root range is 

A 2 = (1 - - Al (6) 

Note that these equations suggest the root reach is about 2.5 times the canopy 
reach. Researchers who have excavated savanna tree root systems report higher 
numbers, around 6 times canopy radius (Rutherford 1983; Chin Ong 1996). When 
these higher values are used in the simulation, effectively no part of the savanna is 
free of competition with tree roots, which does not accord with field observations. 
The solution to this apparent contradiction is probably that the roots far from the 
tree cannot effectively colonise the entire soil volume of the circle described by 
their most distal reach. The density of roots declines approximately exponentially 
with distance from the stem (van Noordwijk 1996) Thus an “effective radius” can 
be described that is smaller than the maximum radius; about 2.5 times greater than 
crown radius seems to provide satisfactory results. Finally, 



A 3 = 1 - (Al + A 2 ) (7) 

In the context of the focus of this paper on non-linearities, the important point is 
that regardless of the assumptions made about canopy and root dimensions and 
plant distributions, the fraction of area that grasses have for their exclusive use, 
and the unshaded area, both decline in a convex form with increasing stand basal 
area of trees. The degree of convexity increases with dispersed spatial distributions 
and decreases with clumped distributions. 

The grass productivity in subhabitat 3 (where there is no tree root or crown 
influence) is given by the equation for grass production in the absence of trees 
(equation 1) 



Pg 3 = a(R - c) (8) 

The grass productivity in subhabitat 2 (root competition but no crown influence) 
depends on the proportion of the available resources in the soil (water and nutri- 
ents) that is captured by trees, and is thus unavailable for grasses. The classical 
hypothesis in this regard is that grasses have preferential access to resources in the 
topsoil, while trees have preferential access in the subsoil (Walter 1971). Empirical 
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evidence (e.g., Scholes and Walker 1993) indicates that the effective rooting-depth 
niche separation is minor by comparison to the niche separation in time, largely 
because water and nutrients are both mostly to be found in the topsoil, and thus 
trees and grass both have most of their roots there. The niche separation in time 
results from the fact that trees expand their leaf area more rapidly at the beginning 
of the growing season, and retain them for longer at the end of the growing season, 
than grasses do. Even though the tree relative growth rate may be slightly less than 
that of grasses, and in some cases the grass peak leaf area may exceed the tree 
leaf area in the middle of the growing season, the beginning-and-end of season 
windows of near-exclusive use provide a strong advantage to trees. 

Assuming that the resource being competed for is water, the fractions of the 
shared water resource acquired by trees and grasses in a mixture can be predicted 
using the “electrical analog” model for water flow. Water uptake is a good proxy 
for nitrogen capture as well, since nitrogen is predominantly taken up in the 
soluble inorganic forms, which require water to be transported to the root. The key 
‘“resistance” to water transport through the soil-plant atmosphere system is the leaf 
resistance (peaf), which is inversely proportional to the leaf area, L(rieaf = L/rstomata> 
where rstomata is the resistance to flow through the pores in a unit area of leaf surface, 
and has a median value of about 2.5 s/cm in both trees and grasses in savannas). 
Because the grass leaves are “in parallel” with the tree leaves as a pathway of flow 
between the soil and atmosphere, the instantaneous fraction obtained by the grass 
is given by 




( 9 ) 



if we assume they are both exposed to the same soil water supply ((^soii) and 
atmospheric demand (^air)- 

The fraction of the water resource obtained over the entire growing season is 
the integral of F^g. There is currently no analytical way of predicting the temporal 
pattern of tree and grass leaf development (“phenology”), since it depends on the 
particulars of rainfall and temperature in a given year, in interaction with the soil 
and any residual water in it. Some generalisations allow a numerical approximation 
to be made to the time-integral of F^g. Despite a range in mean annual rainfall in 
savannas between about 300 and 1200 mm/y, in most cases the rainfall is concen- 
trated into a growing season of about 6 months duration. The trees leaf out as soon 
as the rains begin (or even before, if there is residual water in the soil, provided the 
air temperature is warm enough) and reach full leaf area expansion within about 
two weeks. Within the canopy area, the maximum tree leaf area index (one-sided 
leaf area per unit ground area) is fairly consistent in savannas at 3 m^/m^(Scholes 
et al. 2002). The grasses, on the other hand, need to grow their leaf area by using 
current photosynthesis. Once the rains begin, and commencing with a leaf area of 
about 0.001 m^/m^ provided by root reserves, the leaf area grows by about 2% 
per day. Grass leaf senescence sets in once the accumulated days of water-stress 
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exceed about a week. The water deficit is in turn partly determined by the combined 
tree and grass leaf area. Thus, as the season progresses, the grass leaf area grows 
exponentially, then peaks and declines. Tree leaf loss only commences after about 
14 days of water stress. For the specific climafe dafa of a fypical savanna sife af 
Skukuza, Soufh Africa, an empirical represenfafion of fhe relafionship befween 
sfand basal area of frees (X) and fhe fracfion of fhe wafer resource capfured by 
frees when growing in fhe same soil volume as grasses is 



F 



wt — 



X 

2 + X 



( 10 ) 



The grass producfivify in subhabifaf 2 can fhen be esfimafed as 



Pg2 = a[(R-c)(l -Fw,t)] 



( 11 ) 



Again, fhe key poinf is fhaf fhe relafionship has a convex shape in relation fo free 
basal area for all reasonable assumpfions regarding free and grass phenology. 

In subhabifaf 1, fhe area direcfly under fhe free canopy, fhere is in addifion 
fo fhe competition for wafer, a reducfion in wafer supply and solar radiafion due 
fo infercepfion by fhe free canopy. Abouf 5% of fhe rain falling on fhe canopy 
is infercepfed (de Villiers 1981). The fraction of fhe incoming solar radiafion fhaf 
penefrafes fhrough fhe free canopy fo reach fhe grass layer (Frg), assuming a random 
orienfafion of fhe leaves in fhe canopy, is 

F,g = e-°-5Lt (j2) 



This has fwo effecls. Firsfly, if reduces fhe rale of pholosynlhesis, which is a 
funclion of infercepfed radiafion. Secondly, if reduces fhe rale of evaporation from 
fhe soil and vegelalion under fhe canopy, Ihus prolonging fhe period for which 
pholosynlhesis occurs (Scholes 1988; Belsky el al. 1993). The degree fo which 
fhe latter effecl is expressed depends on whal fracfion of fhe energy budgel for 
evaporation in fhe subcanopy area is provided by radiafion, and whal fracfion by 
fhe movemenl of hoi, dry air across fhe shaded area from fhe adjacenl inlercanopy 
areas. This will vary in a complex way wilh fhe relalive wafer availability in fhe 
fwo pafches. As a firsl approximation, I assume fhaf fhe fracfion provided by direcl 
radiafion is equal fo Ai. Thus fhe “effeclive welness” modifier of nel rainfall (E) is 
proposed fo be 



E = 



1 



1 - A(1 - E,g) 



(13) 



The relafionship befween radiafion (I) and pholosynlhesis (P) can be approximated 
by a reclangular hyperbola 



Q^IPmax 
+ Pmax 



( 14 ) 
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Under the very bright conditions that apply in savanna regions, a large reduction in 
radiation translates into a smaller reduction in net primary production. The tree leaf 
area index in the area under the tree canopy is about 3, which reduces radiation by 
70%, but production only by half. The shading reduction factor is given the symbol 
Fl. 

The nutrient supply under the tree canopy is enhanced relative to the area 
between canopies. This is partly due to the fact that nutrients are taken up over 
the entire tree root extent, but preferentially deposited, by leaf fall and leaching, 
in the area below the canopy. From the perspective of grass production over the 
entire savanna plot, this is a “zero sum game” - the gains in the subcanopy habitat 
are losses elsewhere. The same applies to the additional nutrient input under tree 
canopies that derives from dust interception and the excreta of birds and animals 
congregating near the tree. However, a third component, due to nitrogen fixation by 
microbial symbionts exclusively associated with trees, is additional. If the fraction 
of the soil nitrogen in the soil under the tree that is derived from symbiotic nitrogen 
fixation is Fn (this value can be determined by inference from the proportions of 
isotopes of nitrogen present, and is seldom more than about 0.2), the productivity 
of grass beneath the tree canopy can be estimated as 

Pgi = Fl( 1 + F„)aW(0.95R - c)(l - F„t) (15) 

The Pgi component has the potential to account for the observation that sometimes 
a small quantity of trees in the system leads to increased overall grass production. 
For high values of Fn and small values of stand basal area, the beneficial effecls of 
nifrogen fixalion can oufweigh fhe defrimenfal effecls of compelilion. 

Note lhaf fhe complemenlary relationships (fhe effecl of sfand basal area of frees 
on free production) are generally concave. In ofher words, free producfivify rises 
steeply inifially, and reaches an asympfole af high levels of X. A simple equation, 
consisfenf wifh fhe above logic, is lhaf 

Pt = (Ai + A2)Fw,tRat (16) 

where at is fhe “rain use efficiency” of frees, aboul 0.35 g/m^/mm. 



3. Some Consequences of the Convexity 

3.1. IS PRIMARY PRODUCTION IN TREE-GRASS SYSTEMS A ZERO-SUM 
GAME? 

Historically, the managers of mixed tree-grass systems have tended to focus solely 
on only one of the components. If they were graziers, their objective was to 
maximise grass production, and if they were foresters, it was to maximise tree 
production. The typical solution, in the absence of a model that deals with the 
interaction of trees and grasses, is to eliminate the unwanted component. If the 
overall productivity of the savanna mixture was a linear function of the amount 
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of tree in the system, then this still remains an “optimal solution”: the system 
should be driven to whiehever endpoint has the highest value. If the outcome of the 
interaction between trees and grasses is such that the highest productivity occurs at 
an intermediate level of tree cover, then it is possible that for some combinations 
of values associated with tree and grass products, a mixed system may be optimal. 

The equations outlined above suggest that the only situation where the total 
production at intermediate tree density is higher than the productivity at either 
extreme is in the case where nitrogen fixation by trees occurs, and the rain-use 
efficiency of grasses is lower fhan fhaf of frees (i.e., inferfile soils). 

There are cases where fhe landowner or sociefy require both frees and grasses 
wifhin fhe same landscape. One example is in ecofourism, where a lighfly freed 
landscape (“parkland”) is deemed more affracfive fhan eifher a densely wooded 
one or a freeless one. 



3.2. MULTIPLE STABLE CONFIGURATIONS 

Historically, fhere have been fwo main schools of fhoughf regarding fhe ecological 
dynamics of savannas (see review by Scholes and Archer 1996). Af presenf, fhe 
schools are lending lo converge on a hybrid concepl. Bolh “Iradilional viewpoinls”, 
here conlrasled for clarify, sel oul to explain fhe observed widespread coexisl- 
ence of Iree-grass mixlures. The “equilibrium school” posils fhaf fhe compefifion 
belween free and grasses is balanced or limifed - for insfance by an even sfronger 
compefilive effecl of frees on frees, or by an exclusive niche enjoyed by fhe 
ofherwise weaker competitor. The “disfurbance school” accepfs fhaf frees lend 
to oul-compele grasses, bul propose fhaf fire (whose intensify is dependenl on 
grass abundance) acls to keep fhe frees from complelely dominaling. The synlhesis 
position argues fhaf a comprehensive model of savanna dynamics requires bolh a 
competition mechanism and a disfurbance mechanism. 

Graphical analysis of fhe “slate space” defined by fhe quantify of woody 
biomass (W) on one axis, and grass biomass (G) on fhe olher axis, has been widely 
used to illuslrafe fhe exisfence of sfable configurations (Walker, Ludwig, Rolling 
and Peterman 1981; Walker and Noy Meir 1982). The isoclines where AG = 0 and 
AW = 0 (also called “nullclines”) are plotted info Ibis space, and Iheir inferseclions 
wilh one anolher and wilh fhe axes are inlerpreled as sfable or unslable. A recenl 
analysis of resilience in savannas, based on a differenl model to fhe one proposed 
here bul reaching similar conclusions, is given by Anderies el al. 2002. 

The argumenls of neilher school are dependenl on non-linearilies, bul fhe pres- 
ence of non-linearilies of fhe types described make cerlain oulcomes more likely. 
For insfance, following fhe “equilibrium school” if is Iheorelically possible lo have 
a sfable free-grass mixlure in a silualion where fhere is a linear relation belween 
grass produclion and free amounl. If is also possible to poslulale a linear system 
wilh fwo sfable end-poinls (complete dominalion by frees or freeless) separated by 
an unslable saddle-poinl. If is nol possible, wilh purely linear forms, to have fhe 
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mostly widely-observed situation, whieh is alternate eonfigurations of the mixture 
in which both trees and grasses persist, though perhaps in small amounts (Figure 3). 
The non-linear effect of trees on grasses described by the equations in the first part 
of this paper allow the grass biomass to be severely reduced by tree biomass, but 
never completely eliminated from the mixture. 

The production equations presented above make no prediction of an effect of 
grasses on trees. The empirical evidence on this topic is mixed: in an experiment 
where the grass cover was physically removed, tree growth was enhance in one 
vegetation type but not another (Knoop and Walker 1984). It is clear that the inter- 
action is highly asymmetric: trees have a strong competitive effect on grasses, but 
grasses have a weak competitive effect on mature trees, although they may have a 
strong effect on saplings that have not grown above the grass layer. 

The system of production equations presented above translates into the diagram 
shown in Figure 4, with suitable flux-to-stock conversions. The wood nullcline 
is a vertical straight line, since wood biomass is unaffected by grass biomass. 
This system has only one stable configuration: a tree-grass mixture, typically at 
a relatively high tree biomass and low grass biomass. 

A quantitative analysis of the predictions of the disturbance school requires the 
introduction of an equation predicting the effect of grass biomass, via fire intensity, 
on tree biomass. A simple analysis is that fire infensify is direcfly proportional fo 
fhe fuel load in fhe fire season, which is predominanfly dry grass. Fire weafher also 
influences intensify (Trollope and Pofgiefer 1985), buf if is relafively consisfenf 
during fhe main burning season. Fire in savannas seldom kills frees (or grasses), 
buf if fhe free crown is in fhe flame zone, fhe aboveground parfs are killed and musf 
resprouf from ground level. The critical fhreshold for recruifmenl of frees, and fhus 
fhe long-term growfh of fhe free biomass, is whefher fhe free can grow beyond fhe 
flame heighf in fhe interval befween fires. Flame heighf is an exponenfial funclion 
of fire infensify (van Wilgen 1986). For many semi-arid savannas, wifh a fire refurn 
inferval of fhree fo five years, if is empirically observed fhaf a fire infensify in 
excess of 3000 W/m/s is required fo suppress frees, which franslafes fo a fuel load 
of around 300 g/m^ under average burning condifions and a flame lengfh of abouf 
3 m. Infroducing fhis simple relafion info fhe graphical analysis resulfs in Figure 5, 
in which fwo configurations are possible provided fhaf fhe fuel accumulafion over 
fhe mean inler-fire inferval exceeds fhe 300 g/m^ fhreshold. The firsl is fhe same 
high free biomass-low grass biomass mixfure predicted wifh compefifion alone, 
and fhe second has high grass biomass buf no frees. The fwo are separated by an 
unsfable “saddle poinf”: frajecfories fo fhe leff of fhis poinf fend fo fhe freeless sfafe 
(referred fo as fhe “fire frap”). 

In pracfice, fhe “freeless” configuralion usually has a scaffering of mafure frees 
fhaf became esfablished, offen as a cohorf, during a period when fhe system did nof 
exceed fhe killing fhreshold. This can occur, for insfance, during a prolonged period 
of below-average rainfall. In addifion, fhere are typically many suppressed small 
frees wifhin fhe grass layer, some of which may be decades old. The resilience 
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Figure 3. Graphical analysis of “nullclines” in the woody (W) versus grassy (G) state space 
is a widely used technique for analysing the stability states of savannas. The woody nullcline 
(dashed line) is the set of values of G at which dW/dt = 0, and the grass nullcline (solid line) the 
set of values of W where dG/dt = 0. The linear case is used here to illustrate the method. Where 
the tradeoffs between woody biomass and grass biomass are both linear, three qualitatively 
different cases are possible. In case (a) the tree nullcline lies above the grass nullcline over 
the entire range, and only one outcome is possible: total competitive exclusion of grasses by 
trees. (An analogous case is theoretically possible but not illustrated, with complete exclusion 
of trees by grasses.) In case (b) the woody nullcline lies above the grass nullcline at low wood 
biomass, but above it at high wood biomass. This is equivalent to saying that the competitive 
effect of trees on other trees is stronger than the effect of trees on grasses, and the same for 
grasses. The outcome is a stable coexistence of trees and grasses at an intermediate level. Case 
(c) has the grass nullcline above the wood nullcline at low W, and vice versa at high W. The 
outcome is an unstable (“saddle point”) equilibrium at the intersection. If perturbed from this 
point the system will either trend to a treeless or grassless endpoints. 
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Figure 4. The state-space analysis that would result from the production equations described 
in this paper would show a strong, convex effect of woody biomass on the grass nullcline, 
but no effect of grass biomass on the wood nullcline. The wood nullcline would lie at the 
point where tree-on-tree competition limited further tree growth. The axes have not been 
quantified because the equations described apply to production at the annual timescale, not 
the instantaneous production and loss rates of the components. In reality, if the value of G 
is quantified as, for instance, aboveground green biomass, it shows a strong within-season 
variation from near-zero in the dry season, to a peak value in the middle of the wet season. 
The values of W, if expressed as total aboveground biomass of trees, vary slowly between 
years, but if expressed as tree leaf area (more meaningful for this purpose) show a strong, 
bounded variation within each year. 



of this “open savanna” configuration can be measured as the distance between 
the treeless stable point and the unstable saddle-point. The effect of strong non- 
linearity in the effect of trees on grass production is to shift the saddle point to 
the left, reducing the resilience of the treeless configuration, and increasing the 
resilience of the high tree - low grass biomass configuration. 

3 . 3 . THE DYNAMICS OF BUSH ENCROACHMENT 

When the tree-grass ecological system is coupled to a human system (pastoralism), 
it is commonly observed that the ratio of trees to grasses changes, often very 
rapidly. When European ranchers colonised the savannas of Africa, Australia, 
South and North America, the formerly open, grassy configuration frequently 
switched, within thirty to fifty years, to a densely woody configuration in which 
there was insufficient grass production, given the size of the holdings, to meet 
the income requirements of the rancher. Furthermore, the dense bush increases 
the cost of mustering, and in the extreme case, makes the residual grass physi- 
cally inaccessible to the cattle. Interestingly, these problems were unknown in the 
African pastoral system which preceded the colonial period, and are rare on the 
communal grazing lands in Africa, despite the sometimes high livestock numbers 
they support. The communal lands often exhibit the opposite problem, “defore- 
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Figure 5. Addition of the effect of fire, largely fuelled by dead grass, on trees allows a 
state-space analysis based on production equations to be performed. It is assumed, in this 
case, that a fire of intensity 3000 kW/m/s (equivalent to flames 2.7 m in length, or a fuel load 
of 300 g/m^) once every three years will prevent tree recruitment. This defines the horizontal 
portion of the wood nullcline. The vertical section is defined by the limit of woodiness in 
the absence of fire. An equation resulting from a meta-analysis of global tree cover data in 
savannas suggest this envelope to lie at Xmax = 1.2(R-300). The annual production equations 
can be used to generate the grass nulcline, by calculating the fuel accumulation over the fire 
interval (assuming that 60% of fuel carries over between years if not burned or eaten), as 
a function of woodiness, here represented by the tree basal area. The solution depicted here 
assumes 600 mm annual rainfall, 80% sand content in the soil and 20% of nitrogen being fixed 
by trees. At low W, sufficient grass accumulates in three years to exceed the tree mortality 
threshold. Therefore the system tends to an open state. At high W, insufficient grass fuel 
accumulates to control the trees, and the system tends to a closed state with a small amount of 
residual grass. 



Station”, or a reduction in tree cover to a point where it threatens the supply of 
tree-based products. 

There are several key differences between the two forms of management: 
ranchers generally suppressed fires, made little direct use of trees, and greatly 
reduced the numbers of browsers (herbivores that eat trees) in the system. In 
contrast, communal graziers deliberately set fires, harvesf fhe frees for consfrucfion 
of hufs, pens and for fuel, and fypically run herds of goals in conjunction wilh 
caflle. 

In terms of the simple graphical analysis, the effect of increased grazing is to 
lower the position of the grass nullcline, thus reducing the resilience of the “open” 
configuration, and eventually eliminating it as a possibility. The effect of long- 
term fire exclusion is fo remove fhe horizonfal segmenf of fhe wood nullcline (i.e., a 
refurn fo Figure 4), which also eliminafes fhe possibilify of a sfable “open” configu- 
ration. Increased fire frequency, up fo a poinl and in fhe medium ferm, can lower fhe 
posilion of fhe horizonfal porfion of fhe wood nullcline, increasing fhe size of fhe 
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fire trap. However, over-frequent fire could, in theory, run down the nutrients in the 
system or otherwise weaken the grass such that the grass nullcline is also lowered. 
Browsing and wood harvest have the effect of shifting the vertical portion of the 
wood nullcline to the left, meaning that the residual grass in the higher tree biomass 
configuration is increased, and the resilience of the woody configuration is reduced. 
Under very heavy browsing or wood harvesting the “mixture” configuration can be 
eliminated. These patterns are at least consistent with observations, even though 
they do not prove the hypothesis. 

The ease and rapidity with which ranchers can tip the tree-grass configuration, 
and their difficulty in returning to the formerly more open state, is thus strongly 
related to the convexity of the effect of trees on grass. Following disturbance of 
the grass sward by grazers and fire suppression, small trees become established. 
The direct or indirect reduction of fire frequency and intensity and the exclusion of 
browsers allows them to grow, in a space of about a decade, to the point of being 
able to suppress grass production. The herd size is not reduced as the rancher begins 
to feel the economic pinch, resulting in even less grass being available to carry 
a fire, and the combined pastoralist-savanna system spirals into the encroached 
condition. 

3 . 4 . THE OPTIMAL PATTERN OF TREE CLEARING 

A logical consequence of the convexity of the tree-grass relation is that if tree 
clearing is undertaken for the purpose of increasing grass production, and the 
amount of money available is insufficient to clear completely (or some trees are 
desired to be kept in the system for other reasons), the most cost-beneficial pattern 
of clearing is to remove all the trees in a portion of the landscape, rather than the 
remove a portion of the trees in all of the landscape. The guideline is to begin with 
the least encroached areas first (rather than the usual “intuitive” practice of tack- 
ling the most densely-treed areas first). It is generally easier to manage a “patchy 
clearing” pattern, since a clear operating rule can be established, fire can be used 
as a control mechanism, and the dispersal of tree propagules into the cleared area 
is reduced. 
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Abstract. We study here optimal management of dynamic ecological systems that exhibit a destabil- 
izing positive feedback. The prototype example is that of a shallow lake in which phosphorous 
loading placed by anthropogenic activities (fertilizers for farming and gardening) is stored in sedi- 
ments until a critical level is reached after which there is a destabilizing return to the water - the 
tradeoff between farming interests and lake quality generates an optimal control problem. We show 
that in such systems, there may be a variety of local optima and associated basins of attraction 
wherein the optimal path may depend on starting state (phosphorous stock). We characterize the 
various possible optimal behaviors and identify the ambiguities that can only be resolved by choice 
of functional form. 

Key words: basins of attraction, dynamic optimization, lake ecology, non-convex dynamics, positive 
feedback 



1. Introduction 

This article gives a fairly complete theoretical treatment of a deterministic version 
of the Carpenter et al. (1999) lake management model. This model is meant to be a 
general abstraction of optimal management problems for ecosystems where there is 
a phosphorous load placed by anthropogenic activities (fertilizers for farming and 
gardening) that contributes to stock stored in a lake. When that variable becomes 
too high, it sets off an internal positive feedback mechanism which impairs the 
ecosystem’s ability to absorb and biodegrade loadings. Furthermore there are 
disamenities from degraded lake quality whose flow is proportional to the level of 
the stock variable. Here we will emphasize a fully optimizing version in which the 
manager can measure the stock and can control the loadings as function of stock. 
He chooses these loadings to best trade off the conflicting interests of farmers and 
lake users. Although we will use this lake model as the paradigm example, there 
are a number of other examples coming from ecological studies that seem to have 
similar structure and therefore ought to have roughly identical analysis. We will 
comment further on the general applicability of our analysis in the conclusions. 

We show how methods of infinite horizon optimal control theory developed for 
problems with convex-concave dynamics of the state variable may be applied to 
the lake problem. A reasonably complete analysis turns out to be quite subtle. 
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Before we begin, we must say a few words about the style of exposition we shall 
use in this artiele. We shall be getting into problems of global analysis where there 
are many possibilities to examine and where the analysis will get bogged down 
quickly if we insist on writing in the completely rigorous “s-h” style. Hence, we 
shall write in the style of global analysis where arguments which can obviously 
be made rigorous will be exposited with geometrical devices like phase diagrams. 
Furthermore to avoid technical boundary conditions, we shall assume right hand 
and left hand “Inada” conditions and we shall assume existence of optima to infinite 
horizon control problems even though this has not been proved in all cases. 

Existence theory is essentially trivial in discrete time given literature like that 
treated in Becker and Boyd (1997, see especially the references to Boyd’s work on 
existence in continuous time problems). Existence theory is more technical in the 
continuous time case but is treated in Carlson, Haurie and Eeizarowitz (1991). 

2. The Stylized Lake Ecology 

Eollowing Carpenter et al. (1999), consider the following model of the dynamics 
of a lake where the limnology is boiled down to a one dimensional ordinary 
differential equation, 

dx/dt = a — bx + f(x) = E(a, x), x(0) = given. (2.1) 

Here, x represents the state of the system, and corresponds to the stock of phos- 
phorous suspended in algae (which we will take as proportional to undesired 
turbidity of the water in the lake), a is phosphorous inputs from the watershed 
(sometimes referred to as the “loading”), b is the rate of loss per unit stock (sedi- 
mentation, outflow, and sequestration in organisms other than algae), and f(x) is 
internal loading. This function is assumed to be “S-shaped”; that is, for low stocks 
of phosphorous, additions tend to be stored in the lake bed so there is a relatively 
low marginal return to the water, whereas for higher stocks this marginal return 
increases, only to fall again when maximal suspension is approached. This equation 
represents a “minimal state variable” approximation to the complex foodweb of a 
real lake. The reduction and an argument that a convex-concave “recycling curve”, 
f(x) is a good abstraction capturing real world positive feedback processes that are 
triggered inside a typical lake when P-load becomes too high is given by Carpenter 
et al. (1999) and the references in that paper. See Cottingham and Carpenter (1994, 
p. 2129) for a much more disaggregated model of the state variable dynamics which 
consists of a set of differential equations to represent the food chain of the lake. As 
we said before, one might think of (2.1) as an “aggregative reduced form” for the 
true disaggregated model. 
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Figure 1. Lake model Case 1: unique equilibrium wiht constant loading. 



3. Behavior under Constant Loading 

We can get some intuition for the functioning of this model and set a benchmark 
for comparing with optimal management by considering behavior under constant 
loading, that is the dynamics of equation (2.1) when the loading rate from outside 
sources (a) is held constant. Depending on the relationship between the purification 
parameter (b) and the feedback function (f(x)), various qualitative behaviors are 
possible; we distinguish three distinct cases. 

Case I: Unique equilibrium 

When the purification rate is greater than the maximal feedback rate (b > max f (x)) 
there will be a unique long run equilibrium steady state for any given constant 
loading rate and the steady state will be a continuous increasing function of the 
loading. Furthermore, convergence to the steady state will be monotone from any 
starting stock of phosphorous. Referring to Figure 1, if the loading is a and x is 
above x, bx-a is larger than f(x) so dx/dt is negative and x will fall toward the 
unique steady state x. Similarly, the reverse dynamics holds if initial stock is below 
X. Here the nonconvexity appears inessential in that behavior is qualitatively as it 
would be in a fully convex version. 

However, we will see that this need not be true in the optimizing version - that is 
for some combinations of parameters consistent with Case I, there will be multiple 
local optima. As we will see, this situation becomes more likely in the other cases. 
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Figure 2. Lake model Case 2: 2 basins of attraction with reversibility (constant loading). 



Case II: Multiple equilibria with reversibility 

When the purification rate is smaller than the maximal feedback rate, there will 
be multiple long run steady states for some values of the constant loading. Case 
II corresponds to the situation when, in addition, f(x) is less than bx for all x (see 
Figure 2). Then for initial loadings between Li and L 2 , there will be two locally 
stable long run steady states, one involving a relatively low x “clear water” state 
(referred to as oligotrophic in the ecology literature) which is highly valued by 
the lake’s users, but also a relatively high x “turbid water” state (referred to as 
eutrophic in the ecology literature) which is disliked by the lake’s users but valued 
by agricultural interests because it allows them high fertilizer use. Which state is 
reached for a given constant loading depends on the initial stock of phosphorous, 
high initial stocks leading to a eutrophic long mn equilibrium and vise versa (as 
indicated by arrows in Figure 2). These two “basins of attraction” of the lake will 
play a major role in the following discussion. In this situation, history matters in 
that for a given constant load the long run steady state will depend on the initial 
level of X. However, all long run outcomes are reversible in this case. A sufficiently 
low load (below Li) will always restore the clear water type state if it is maintained 
sufficiently long. 

We will show that a fully optimizing version of case II can display a vide variety 
of qualitative behaviors. There can be a single locally optimal path or as many as 
you like, depending on parameters. Further, we will show that history may or may 
not matter in the optimizing solution. 

Case III: Multiple equilibria with irreversibility 

If f(x) lies above bx for sufficiently high x, then behavior is as in case II except 
that once the phosphorous level exceeds a certain critical value, there is no policy 





MANAGING SYSTEMS WITH NON-CONVEX POSITIVE FEEDBACK 



81 




that can ever again achieve lower values. In Figure 3, this critical level is x. If the 
initial stock is larger than that, then even if we reduce the external loading to zero, 
the phosphorous stock will converge to a eutrophic long run steady state (at x). 

In our optimizing version there will be a unique locally optimal policy if the 
initial stock is above x. However, if it starts below, there may still be multiple 
local optima - that is it might be desirable to cross the threshold from below in an 
optimal management scheme. We will try to show when and how this happens. 

Notice that the function f(x) is the source of the potential hysteresis in all of 
these cases. A plausible mechanism to generate an effect like f(x) is the theory put 
forth in Scheffer (1997, Chapter 3 and Figures 3.19 and 3.22). Further, the model’s 
“prediction” that lakes may be seen to flip from low turbidity to high turbidity 
states in response to small changes in loading factors seems to be confirmed by 
observation. 



4. An Optimizing Framework 

There is a community of individuals who share the lake and its watershed. These 
individuals have conflicting interests. Some are producers whose activities release 
loadings into the catch basin of the lake, which contribute to the term a in (2.1). 
Others are users of the services of the lake and are injured by the “dumping” 
activities of the others in that they care about the overall quality of the lake which 
we proxy by the stock x - higher stocks meaning lower quality. 

Utility (measured, for example, as increases in profitability of economic opera- 
tions) is generated for the producers of a from agriculture, lake side cottages, devel- 
opments, etc. Disutility is generated by x and must be measured by willingness to 
pay for a clean water environment. 
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We formulate the objeetive of a Soeial Planner who aets in the interest of the 
community as a whole as follows, 

pOO 

W(x) = Max{ / exp(— rt){U(a, x)}dt}, subject to (2.1). (4.1) 

Jo 

Here the maximum is taken over the set of piecewise continuously differentiable 
functions ja(t)}. We assume that the utility U(a, x), is increasing in a and is 
decreasing in x. This generates a tradeoff which the social planner must optimize. 
Here r > 0 is the real rate of discount on future utility. Since this rate is gener- 
ally thought to be quite small (on the order of 1-2%) we will focus attention on 
situations where it is small relative to the parameter b. 

The plan of this part of our paper is as follows. We shall study the optimal 
solutions of (4.1) and use the first order necessary conditions of optimality to 
characterize these solutions. We shall spend most of our time on the special case 
U(a, x) = Ui(a) -I- U 2 (x) because, here, we can “separate” the interests of the 
Affectors (those interests like agriculture and developers whose activities damage 
the resource) from the interests of the Enjoy ers (those interests like fishers, viewers, 
and swimmers whose acfivifies do nof injure fhe resource). We fake fhe ferms 
“Affectors” and “Enjoyers” from fhe work of Scheffer ef al. (1998). We shall also 
spend some lime on fhe case where U] is linear because if generales a particularly 
simple dynamics fhal molivales fhe more general behavior. Einearily can be Ihoughf 
of as representing a reduced form reslricled profil function of a conslanf relurns lo 
scale Affeclor sector. This case is moslly of pedagogical inleresl since wilh a fixed 
amounl of land in proximity to fhe lake, we would expecl diminishing relums. 



5, The Most Rapid Approach Special Case 

Here we sludy fhe linear, separable case Ui = Aa, U 2 = -c(x), c' > 0, c” > 0. 
Eor fhe deterministic case linear U generales a Mosl Rapid Approach Problem 
(MRAP) where one operates fhe conlrol variable a lo push fhe slale variable x lo 
fhe maximum of a cerlain funclion of x as we shall see. Thus we now sludy fhe 
problem: 

poo 

W(x) = Max{ / exp(— rl){Aa — c(x)}dl}, subjecl to (5.3) 

Jo 

dx/dl = a — bx -h f(x). (5.4) 

We assume fhal f(x) is convex-concave and is increasing in x wilh f(0) = 0, f (0) = 0, 
f ( 00 ) = 0, wilh unique Xm = argmax f , f" > 0 for x < x^ f" < 0 for x > Xm. Notice 
lhal if we define AP(x) = f(x)/x, MP(x) = f (x), Ihen jusl as in elemenlary economics 
of increasing relurns production functions, we have MP > AP (MP < AP) for x < 
X m (x > Xm) wilh MP = AP for x = Xm. Eurlhermore, AP(0) = MP(0) and AP(oo) = 
MP(oo) = 0. 
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Now, for any finite T, substitute for “a” from (5.4) and observe that 

T 

exp(— rt){Aa — c(x)}dt = (5.5) 

A[exp(— rT)x(T) — x(0)] + f exp(— rt){A[rx + bx — f(x)] — e(x)}dt. 

Jo 

Hence if we restrict ourselves to the set of functions x(.) that satisfy the transversa- 
lity condition: 

exp(— rT)x(T) ^ 0, T ^ oo, (5.6) 

then (5.5) suggests that we control a(.) to move x(t) to most rapidly approach 

X* = argmax 7t(x) = {A[rx + bx — f(x)] — c(x)} (5.7) 

Indeed, since the objective is now linear in control (a) we know from the work 
of Spence and Starrett (1975) that even if there are restrictions on the control, the 
optimum from any starting point is to most rapidly approach some local maximum 
in (5.7). We can think of this objective as trading off long run interests of Affectors 
and Enjoyers. Affectors get a flow benefit of rA per unit from the stock per unit 
time while Enjoyers pay cost c(x). In addition the stock provides a net purification 
benefit of bx - f(x) which acts like a productivity term in the objective. 

Notice that it is easy for the mathematics in (5.7) to yield “unpleasant” solutions. 
Eor example, if the turbidity disutility U 2 (x) = -c(x) were bounded from below, 
(5.7) would shoot x off to infinity and turn the lake into a dumping ground for the 
a > 0 polluting “stakeholders”. This is so because the polluters are willing to pay 
A > 0 for each extra unit of x-dumping while the willingness to pay to prevent 
such dumping embodied in the disutility is falling to zero. 

Part of the reason for this anomaly derives from our assumption of linearity 
(marginal benefit from pollution assumed not to decline in pollution level). 
However, it seems sensible to posit an unbounded cost which will assure an interior 
solution. If x(t) can be moved immediately to x*, we have from (5.5) 

W(x) = -Ax(0) + (l/r)[A[rx* + bx* - f(x*)] - c(x*)]. (5.8) 

An initially pristine lake is one with x(0) = 0. We may differentiate W to obtain 
how much an innovation would be worth that increased “b” by one unit, 

dW/db = (l/r)ax*. (5.9) 

The innovation allows the lake to “biodegrade” one more unit of x per unit time. 
This extra loading capacity is worth A to the polluters each period. When this 
amount is capitalized over all periods at rate r we obtain (5.9). Similar sensitivity 
analysis can be done with for other parameters of interest. 

The analysis provides more insight into the nonlinear case if we consider 
constaints on the control variable that will prevent instantaneous adjustment. 
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Mass of Phosphorous in Algae in Water 



Figure 4. A multitude of equilibria. 



Nonnegative values of a naturally prevent sueh downward adjustment and there 
may well also be upper bounds (say a < M) which similarly prevent upward 
adjustment. Then, as in Spence and Starrett (1975) we must search among local 
optima of 7t(x). Examining this function, we see that there may well be multiple 
local optima. To see this, examine the first order conditions for a maximum: 

A[r + b — f (x)] = c'(x). (5.10) 

Since for x > x^, f (■) is decreasing and c'(x) is increasing, we can make particular 
choices for these functions that make the left and right sides of (5.10) equal as often 
as we like (see Figure 4).^ (We will be somewhat more rigorous in demonstrating 
similar properties in the nonlinear case.) 

As always, generically, the equalities will alternate between local maxima and 
local minima. Since we assume f(0) = c'(0) = 0, it is easy to see that the first 
intersection must correspond to a local maximum; also, since f (oo) = 0 and c'(oo) 
= oo, the last intersection is also a local maximum. Of course, multiplicity need not 
always happen for all choices of parameters and functions, but we see that arbit- 
rarily large numbers of local optima cannot be ruled out a priori. This observation 
will carry over to the general optimization problem as we will see. 

Let us consider the simplest generic multiplicity case where 7t(x) has two local 
maxima a and y separated by a local minimum /I. The remaining problem is to 
determine which of the local maxima is the optimal long run steady state from a 
given starting position given the restrictions on controls. The global maximum may 
not always be the long run optimum, as it may take too long to reach it from the 
other local optimum and this matters if we discount enough. 

We will not try to solve this problem completely here. However, the basic 
qualitative analysis is easy to give and will carry over to our nonlinear general 
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case. As long as the controls are bounded so that our single state variable must 
move continuously, then a simple application of the Bellman principle implies 
that it must move monotonically in an optimal trajectory and the set of states for 
which a particular local maximum is the long run optimum must be connected. 
(The set associated with a particular local maximum will be referred to as its basin 
of attraction.) Consequently, either the global maximum is always optimal or there 
is a knife-edge point between the local optima such that the global optimum goes 
to Y for all starting stocks to the right of the knife-edge and to a for all stocks to 
the left. We refer to this knife-edge point as the Skiba point since he identifies and 
studied it in a related renewable resource problem (see Skiba (1978)). 

We turn now to the general nonlinear case where we will find a similar qualita- 
tive picture. We will try to discover what particular features of the problem lead to 
multiplicities (or lack thereof) and will discuss ways of determining Skiba points 
that result. 



6. General Analysis of the Lake Problem 

Consider the general separable-utility problem: 

pOO 

W(x) = Max{ / exp(— rt){u(a) — Bc(x)}dt}, (6.1) 

Jo 

Subject to dx/dt = a — bx -|- f(x), x(0) = x. 

We assume: u' > 0, u'(0) = -i-oo, u" < 0, c(0) = 0, c' > 0, c'(0) = 0, c" > 0. Verbally, 
utility to Affectors is concave (diminishing marginal utility), costs to Enjoyers are 
convex (increasing marginal cost), with B representing a parameter that determines 
the relative importance of the two groups. The assumptions of f(.) are as in the 
previous section. 

We use optimal control theory to characterize optimal solutions. Using p as the 
current value costate variable, we set up the current value Hamiltonian: 

H(a, X, p) = u(a) — Bc(x) -|- p{a — bx -|- f(x)}. 

Then the maximum principle tells us that optimal loading is set to satisfy 

a > 0 ^ u'(a) = — p, a = 0 ^ u'(a) < — p, (6.2) 

defining a function a(p). The Hamiltonian dynamics then are written as the coupled 
pair of differential equations: 

dx/dt = a(p) — bx -h f(x), (6.3) 

dp/dt = {r -h b — f(x)}p + Bc'(x). (6.4) 

The equations (6.2)-(6.4) constitute necessary conditions for a solution to (6.1) in 
that for the optimal policy we must be able to find a funcfion p(.) such fhaf fhese 
equafions are safisfied for all fime. 
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Parenthetically we remark that the assumption, u'(0) = +oo implies that a = 0 
^ -p = + 00 , so we may ignore the strict inequality case in (6.2). We shall do this 
in what follows. Also, due to this fact, it is sometimes convenient (particularly in 
doing phase diagrams) to change variables and work in {a, x} space rather than {p, 
x} space. This is accomplished by logarithmically differentiating (6.2) with respect 
to time, obtaining: 

(dp/dt)/p = (da/dt)[u"(a)/u'(a)]. (6.5) 

Then, substituting into (6.4). We get the corresponding pair of equations: 

dx/dt = a — bx + f(x) s X(a, x), (6.3') 

da/dt = [r + b-f(x)][u'(a)/u"(a)] -B[c'(x)/u"(a)] = A(a,x,B). (6.4') 

Note for future reference that 9A/9B = -c'(x)/u"(a) > 0. 

We shall proceed to do a general analysis of this problem by carrying out the 
following steps: (i) Investigate sufficient conditions for a unique steady state in 
these equations and find sufficient conditions for multiple steady states. It turns 
out to be easy to construct examples where there are as many steady states as 
you want, (ii) Conduct a dynamic analysis using linearization around steady states, 
p(x) = W'(x), and phase diagrams in order to piece together the value function W(x) 
following the strategy of Skiba (1978), Brock and Dechert (1983, 1985) (exposited 
in Brock and Malliaris (1989)). 

7, A Unique Steady State 

It turns out that there can be a unique locally optimal steady state in either case 
one or case two as identified earlier (buf nof in case fhree). However fhe oufcome 
is quife parameter dependenf and wifh parficular choices, fhere can be multiple 
steady slates in any of fhe cases. In Ibis section, we reslricl ourselves lo cases one 
and Iwo. We show firsl lhal for any paramelrizalion consislenl wifh Ihese cases, 
fhere is a breakeven B (call if B*) such lhal fhere exisls a locally optimal steady 
slate in fhe eulrophic (high x) basin of aflraclion if an only if B is below B*. If 
will Ihen follow almosl immedialely lhal if B is above B*, Ihere is a unique local 
(and Iherefore global) optimal palh converging lo a steady slate in Ihe oligolrophic 
(low x) basin of allraclion. We will also see lhal Ibis palh is saddle-value slable. In 
later sections, we will explore Ihe behavior wilh multiple steady slates. 

To identify steady slates, we look for a joinl solution lo Ihe equations dx/dl = 
da/dl = 0, namely: 

a — bx -h f(x) = 0, (7.3') 

[r + b - f (x)][u'(a)] - B[c'(x)] = 0. (7.4') 

These equations define curves in a,x space where dx/dl = 0 and da/dl = 0 respec- 
tively. Note lhal in Ibis silualion, bolh of Ihese define a uniquely as a function of x 
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and we label these ai(x) and a 2 (x) for eonvenienee. Sinee bx > f(x), all x > 0 in 
eases one and two, ai(x) will be strietly greater than zero for all positive x in these 
eases. This is the important eontrast with ease three for purposes of this section. 
Any candidate for an optimal steady state must satisfy (7.3'), (7.4'), since (6.3'), 
(6.4') are Hamiltonian necessary conditions. 

Recall the following features of f(.): Xm is the unique inflection point where 
f" = 0. Where they exist (case two), Xi and X 2 are the two interior points where 

f'(Xi) = b. 

Lemma 1; 

In case two, there exists a B such that for all larger B, and any (a, x) with x > Xi 
and X(a, x) = 0, then A(a, x, B) > 0. 

Proof: 

Examining (6.3'), (6.4'), we can see that the conditions X(a, x) = 0, A(a, x, B) > 0 
are equivalent to 

[r + b - f (x)][u'(bx - f(x))] - B[c'(x)] < 0. (7.5) 

Now, for X > Xi, using concavity of u(.), convexity of c(.) and nonnegativity of f(.), 

[r + b - f (x)][u'(bx - f(x))] - B[c'(x)] 

< [r + b][u'(bx 2 - f(x 2 ))] - B[c'(xi)]. 

Hence, recalling that 9A/9B > 0, the conditions of the lemma are satisfied by 
choosing 

B= [r + b][u'(bx 2 - f(x 2 ))]/c'(xi). Q.E.D. 



Corollary 1: 

In case one, there exists a B such that for all larger B, and any (a, x) with x > Xm 
and X(a, x) = 0, then A(a, x, B) > 0. 

Proof: 

Observe that all the steps of Lemma 1 go through for case one if Xm is substituted 
for both X] and X 2 . Q.E.D. 

The significance of lemma and corollary for us is that when B is large enough 
there cannot be any candidate steady states with x bigger that Xi (case 2) or Xm 
(case 1). This accords with our intuition that if enjoyers matter enough the social 
planner will not want to go toward the “high x” basin of attraction. Next we 
establish something of the reverse for low B. 

Lemma 2: 

Pick any x bigger than x^ for which r + b > f'(x).^ Then in either case one or case 
two, if a is such that X(a, x) = 0, there exists B > 0 with A(a, x, B) < 0. 
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Proof: 

We require 

[r + b - f (x)][u'(bx - f(x))] > B[e'(x)]. (7.5) 

Sinee the left side of (7.5) is strictly positive and c'(x) is finite, this inequality can 
be satisfied for positive B. Q.E.D. 

Using fhese lemmas, we can show fhaf fhere is a breakeven relative imporfance 
of fhe parlies such lhal if Ihe enjoy ers are relafively more imporfanl fhere will 
be a candidate steady sfafe in fhe low-x basin of alfraclion whereas if fhey are 
less imporfanl fhere will always be a candidafe sleady slate in fhe high-x basin of 
alfraclion. 

Proposition 1: (note: This and laler propositions lo be read Iwice - wilh and 
wilhoul braces) 

In case 1 {case 2}, fhere exisls a B* > 0, such fhaf 

B < B* if and only if fhere is a candidate steady slate wilh x > Xm(xi} (7.6) 



Proof: 

We sel B* lo be fhe smallesl lhal will satisfy lemma 1 (corollary 1 }. Since B sels a 
positive lower bound, Ihere musl be such a positive leasl upper bound. Q.E.D. 

Proposition 2: 

In case 1 (case 2}, if B > B*, there will exist a unique candidate steady state with 
X < Xm (x < Xi}. 

Proof: 

Prom Corollary 1 { lemma 1 } we know that 

A(ai(Xm), Xm, B) > 0 (A(ai(xi), Xj, B) > 0} 

Purther, since ai(0) = 0, using (6.4') and the assumption c'(0) = 0, we see that 
A(ai(0),0,B) < 0. 

Hence by continuity, there exists 0 < x* < x^ (xi } such that A(ai(x*), x*, B) = 0. 
By definitions of functions ai(.) and A(.), x* satisfies the steady state equations 
(7.3'), (7.4'). Purther, since ai(.) is monotone increasing and a 2 (.) is monotone 
decreasing in x on the interval [0, x^] { [0, xj } there can be only one such candidate 
steady state in that interval. And from proposition 1, there are no such candidates 
outside the interval. Q.E.D. 

In the remainder of this section we focus on the situation in which there is a 
unique steady state (B >B *). The interpretation of this steady state in terms of long 
run optimization is very similar to what we found in the MRAP case. (7.4') tells 
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US that the marginal consumption plus investment benefit of the stock to Affectors 
([r + b - f (x)]u'(a)) must be set equal to the marginal cost to Enjoyers (B[c'(x)]). 
The only difference now is that marginal utility to Affectors is no longer constant 
and in the long run optimum, a must satisfy (7.3') in order that the steady state level 
of X be maintained. 

Notice that for the “Ramsey” case, r = 0, steady states, (a*, x*) solve the 
problem: 

(a*, X*) = argmax {U(a, x), s.t. 0 = a — bx + f(x)}. 

Substitute out a = f(x) - bx from the constraint into the utility to obtain 
X* = argmax (U(f(x) - bx, x)}. 

In our case of separable utility this objective becomes u(f(x) - bx) - Bc(x) and 
looks even more like a benefit minus cost objective. 

Now however, we care not just about the steady state level of x but also on how 
quickly we get there: The value per unit of a depends on the rate of loading so the 
total benefit to Affectors depends on how loads are distributed over time. So we 
turn next to transitions. 

The behavior away from steady states is most easily (though not entirely 
rigorously) analyzed using phase diagrams. These are constructed by plotting the 
functions ai(.) and a 2 (.) And deducing the direction of motion in all regions of the 
a, X space between these curves using (6.3') and (6.4'). In the following discussion, 
refer to Figures 5 (for case 1) and 6 (for case 2). Shapes of the curves differ in the 
two cases. 

From (7.3') we deduce: 

dai/dx = b — f (x). (7.7) 

Thus, ai(.) is monotone increasing for case one, whereas it increases to Xi, 
decreases from there to X 2 , and increases thereafter in case 2 (as we have drawn). 

Turning to (7.4'), we see that (generally) in case 2, a 2 (.) will intersect the x axis. 
This happens as long as f is bigger than r + b on some interval (represented in the 
diagrams as [x, x). Implicitly differentiating to find shapes elsewhere, we find 

da 2 /dx = [Bc"(x) + u'(a)f"(x)]/[(r + b - f'(x))u"(a)]. (7.8) 

Oufside fhe above mentioned inferval fhe denominator of (7.8) is always negafive. 
The numerator is posifive for x < Xm, and also for sufficienlly large x (since f goes 
fo zero). However in an infermediafe range, if is clearly ambiguous. Indeed, we 
know fhaf for B sufficienlly close to B*, a 2 (.) musl have a rising segmenl since al 
B* fhe fwo curves musl inlersecl and ai(.) is slriclly positive (and slriclly increasing 
in case 1). Since, in addition, compufalional simulations generally exhibil an 
increasing segmenl, we have drawn fhe diagrams in lhal way. This fealure will 
have interesting implications as we will see below. 
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Figure 6. Phase Diagram: (B > B*, Case 2, unique equilibrium). 



We have drawn in arrows to indieate the direction of motion off the curves of 
stationarity as determined by equations (6.3') and (6.4'). For example, since 9A/9a 
> 0. We know that dx/dt is positive above ai(.) and negative below. Similarly we 
find that da/dt is positive above a 2 (.) and negative below. Thus we get the indicated 
qualitative dynamics in the four “quadrants” of our diagrams. 

Examining these diagrams, we see that the unique steady state is saddle value 
stable (this can be demonstrated more rigorously using linearizations, and we 
will utilize that approach later). Under standard regularity conditions (cf. Hartman 
(1973, See Chapter IX on Invariant Manifolds and Linearizations)), there will exist 
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a unique one dimensional loeal stable manifold of points satisfying these differen- 
tial equations and converging to the steady state. Hence by running time forward 
and backwards along solutions of the costate/state equations that lie on the stable 
manifold near the steady state, one may construct an entire global stable manifold. 
Since the costate variable p(x) must be the derivative of the value function, points 
(x, p(x)) on the global stable manifold give (x, V'(x)) for a “candidate” value 
function V(x). 

Are there other solutions (besides that lying on the global stable manifold of 
the unique steady state depicted in Figures 5 and 6), of the costate/state equations 
that might be optimal? This question has not been definitively answered even in the 
general literature on optimal control. All we know in general is that if there is an 
optimum it must satisfy the costate/state equations for all x. However, arguments 
parallel to those in Skiba (1978) and Ekelund and Scheinkman (1986) suggest that 
we look for solutions of the costate/state equations that satisfy the transversality 
condition at infinity (TVCoo), 

(TVCoo) exp(— rt)p(t)x(t) ^ 0, t ^ oo. (7.9) 

Solutions of the costate/state equations that lie on the global stable manifold satisfy 
(7.9) as long as r > 0. This is because p and x converge to finite limits and the 
negative exponential converges to zero. However, since transversality is not known 
to be strictly necessary or sufficient here, one generally finds an auxilliary argument 
to rule out other paths. 

If we assume c(x) increases sufficiently rapidly relative to u(a) this will elimi- 
nate (x(t), p(t)) such that x(t) ^ oo, p(t) ^ 0, a(t) ^ oo by a comparison 
argument. Solutions (x(t), p(t)) such that x(t) ^ 0, a(t) ^ 0, p(t) ^ — oo as 
t ^ oo can also be eliminated by a comparison argument provided, c(0) = 0 
and u(0) < 0. Hence the only candidate solution remaining in this case is the one 
depicted in the figures. Consequently there is a one dimensional stable manifold 
which extends over all values of x, converging to the steady state and these paths 
constitute the optimal transition programs. As mentioned before (and now seen in 
the diagrams) the steady states must lie in the low-x basin of attraction and paths 
converge monotonically in x. What is more surprising and new is that they need not 
converge monotonically in a. One might expect that if the initial stock is too high 
relative to the long run optimum, the manager would initially set the loading quite 
low (thus lowering the stock quickly to benefit Enjoyers who experience increasing 
marginal cost) and let it rise over time to the long run optimum. 

However, this behavior is not generally optimal here, and the reason has to do 
with the shape of f(.). When x is quite large, there is saturation and lowering x 
does not give the feedback benefits it will in intermediate ranges. Thus, it may 
be optimal to start with relatively high levels of a (possibly even higher than the 
long run optimum) to benefit Affectors, and lowering these only when the region 
of maximum feedback benefit is reached. 
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It is interesting to note that parameters r and B affeet only the da/dt = 0 eurve 
whieh makes for easy comparative dynamics. For example, an increase in the 
discount rate will raise the a 2 (.) curve but not affect the ai(.) curve. Consequently 
the long run equilibrium level of x will rise - that is, with more impatience, the 
planner worries more about the immediate benefits to Affectors and less to the 
long run benefits of Enjoyers. 

Similarly, we can deduce comparative dynamics by considering (for example) 
a surprise increase in B due to a new surge of wealthy recreationists locating on 
the shores of the lake. This event causes the da/dt = 0 curve to fall and does not 
change the dx/dt = 0 curve. Let the lake be at the old steady state. Draw the old 
stable manifold on Figure 5 (or 6) and draw the new stable manifold which lies 
below it. The new optimal loading a drops immediately to locate on the new stable 
manifold and slowly rises as x falls to the new, lower steady state level of x. We 
italicize “rises” because one might expect “a” to keep decreasing for a while. In 
the real world there would be adjustment costs to rapid change in “a” which may 
deliver a result more consistent with intuition. 

8. Multiple Steady States 

Once we pass from the situation of a unique steady state, there generally will be 
more than one locally stable manifold satisfying all necessary conditions and trans- 
versality. Our strategy will be to use phase diagrams of the current value costate/ 
state equations to identify all solutions that satisfy the transversality condition at 
infinity. Given these, we can use the Hamilton-Jacobi property: p(x) = W'(x) to 
construct a finite collection of “candidate” value functions (Vi(x), i = 1, 2, . . ., I}. 
The optimal value function is given by 

W(x) = max{Vi(x)}. (8.1) 

i 

The points where the maximizing “i” changes constitute the switch (Skiba) points 
that correspond to those we discussed briefly in fhe MRAP section. We will fry 
fo characterize fhese later. Generally, each of fhe “i’s” above will correspond fo a 
locally sfable steady slate, so firsl we explore fhe polenlial mulliplicily of fhese. 
We relurn fo an analysis of fhe steady slale equalion: 

[r + b - f (x)]u'(bx - f(x)) = Bc'(x). (8.2) 

If we lei R(x) represenl fhe funclion on fhe lefl hand side of (8.2), Ihen fhe argumenl 
given in secfion 5 can be used fo show lhal if (and only if) R(.) has a monotone 
increasing segmenl in an inlerval where if is slricfly posilive, a c'(x) can be found 
under which Ihere are arbilrarily many solutions fo Ibis equalion. We demonslrale 
Ibis somewhal more rigorously here. 

Lemma 3: 

Assume R'(.) is conlinuous and Ihere is an x = A > 0 such lhal R(A) > 0 and 
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R'(A) > 0. Then there is a eonvex inereasing eost function c(x) such that (8.2) has 
countably many positive solutions. Under the plausible right hand and left hand 
“Inada” conditions R(0) > c'(0) and there is Xm such that x > Xm ^ R(x) < c'(x), 
we have (generically) R intersecting d from above for the first steady state and R 
intersecting d from above for the last steady state. Hence, generically there are an 
odd number of steady states if the number is finite. 

Proof: 

Notice that the only requirement for convexity of c(x) is that c'(x) be 
nondecreasing. Now R'(A) > 0 implies there is an open interval I = (A - e, A + e), 
e > 0 such that R(x) is strictly increasing on I. Now construct a nonnegative non 
decreasing function c'(x) on [0, oo) such that it intersects R(x) on I a countable 
number of times. This can be done because R(x) is strictly increasing on I. The 
second part of the Lemma is obvious. Q.E.D. 

Of course, assumptions on R(.) are not fundamental since they are not stated on 
the primitives of the problem. We now seek such primitive assumptions. Note first 
that R(.) is not well defined unless bx > f(x) so we resfricf fo cases 1 and 2 for fhe 
momenf. Furfher, R(.) will be positive if and only if r + b > f (x) so our search will 
be resfricfed fo infervals where fhis inequalify holds. Assuming only fhaf f(.) and 
u(.) are fwice differentiable, R(.) is differenliable and we have: 

R'(X) = [r + b - f (x)]u"(.)[b - f (X)] - u'(.)f (X). 



Proposition 3: 

There always exisfs paramefer choices consisfenf wifh case 2 such fhaf fhere are a 
counfable number of candidate sfeady sfafes in a neighborhood of X 2 . 

Proof: 

In case 2, r + b - f(x 2 ) > 0 and bx 2 - f(x 2 ) > 0, so R(x 2 ) > 0. Furfher, since 
f"(x 2 ) < 0 and b = f (X 2 ), we see fhaf R'(x 2 ) = -u'(.)f"(x 2 ) > 0. The proposition 
fhen follows from Lemma 3. Q.E.D. 

Unforlunalely, we are unable fo gel such a clean resulf for cases 1 and 3. In case 
1, fhe firsl ferm in R'(.) is always negafive so fhe mulfiplicily resulf will depend on 
fhe relative absolufe magnifude of u" and f . For case 3, X 2 lies in fhe interval where 
(8.2) cannof have a solution and again, wherever R is positive and f" is negafive, 
fhe firsl term in R' is negafive. Clearly, we could slate sufficienl conditions on f" 
and u" fhaf would guaranlee mulliplicilies in cases 1 and 3, bul we do nof do so 
explicilly here. 

To analyze fhe behavior of dynamics away from steady slates, we now employ 
fhe slandard linearization melhodology. If is convenienl fo reverl fo fhe slale/coslale 
formulation bul defining Q = -p so fhaf all variables fake on nonnegative values 
(recall fhaf since fhe slate variable is a “bad” in fhis problem, fhe nalural coslale is 
negative). In Ihese unils, fhe equalions of molion lake fhe form: 




94 



dQ/dt = (r + b - f (x))Q - Bc'(x), 
dx/dt = a(Q) — bx + f(x), 



W.A. BROCK AND D. STARRETT 



(8.3) 

(8.4) 



where a(Q) is implicidy defined by u’(a) = Q. 

The linearized dynamies is defined by fhe Jacobian coefficienfs: 

J„ = r + b - f (.): Ji2 = -Bc"(.) - u'(.)f"(.): J21 = 1/U"(.): J22 = f (.) - b. 

Lef IJI denofe fhe determinant of J. The eigenvalues are given by solutions to IJ-AII 
= 0, and solving this quadratic equation, we find: 

A = [r± V{r"-4|J|})]/2, (8.5) 

where 



|J| = -r(f - b) - (b - f) + (l/u")(Bc" + u'f ). 

From fhis we see fhaf fhe sign of IJI is crifical fo local dynamics. If IJI is negative 
fhen bofh eigenvalues are real wifh one greafer fhan zero and one less fhan zero. 
This is fhe case of saddle value sfabilify where we expecf a one dimensional local 
sfable manifold converging fo fhe associafed local sfeady sfafe. On fhe ofher hand, 
if IJI is positive (and r relatively small) fhen fhe roofs are complex and we expecf 
cyclic dynamics. We will amplify on fhese possibilifies below. 

If furns ouf fhaf for sfricfly concave u(.), fhere is a very simple characferizalion 
of fhe sign of IJI. Lef fhe dx/df = 0 and dQ/df = 0 curves be represenfed by Qi(x) 
and Q 2 (x) respectively. Then we have: 

Lemma 4: 

Assuming U" < 0, evaluated af any steady slate. Sign {IJI} = Sign (dQi/dx - 
dQz/dx}. 

Proof: 

From (8.3) and (8.4) we derive: 

dQi/dx — dQ 2 /dx = (8.6) 

(b - f )u" - Bc7(r + b - f ) - BcT7[(r + b - f )^] 

Consequenlly, since u" < 0 and al any steady slate, r + b - f > 0, 

Sign (dQi/dx - dQ 2 /dxj = (8.7) 

Sign {-(b - f )(r + b - f ) + Bc7u" + BcY7(r + b - f )u"j. 

Then, subsliluling for Bc7(r + b - f ) from fhe steady sfafe condilion (8.2) and 
comparing wifh fhe formula for IJI, fhe lemma follows. Q.E.D. 

From Ibis lemma, we can infer fhe following: 
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Proposition 4: 

For strictly concave u(.), candidate steady states generically will alternate between 
two types, type 1 where there will exist a one dimensional local stable manifold 
converging to the steady state and type 2 which are totally unstable. Further the 
first and last steady states (assuming there are more than one) are of type 1 . 

Proof: 

Generically (8.6) will be nonzero and the Sign will alternate between positive and 
negative at successive intersections. Therefore, from Lemma 4, IJI will alternate in 
sign at successive steady states. When IJI is negative we know we are at a type 1 
steady state. When IJI is positive, we can see from (8.5) that either the eigenvalues 
are real in which case they must both be positive, or they are complex with positive 
real part - in either case totally unstable. By our assumptions on u(.), f(.) and c(.), 
Qi(0) < oo and Q2(0) = oo, so the first intersection is of type 1 and since we already 
established that the number of candidate steady states is generically odd, the last 
intersection must be of type 1 as well. Q.E.D. 

Note that there is some remaining ambiguity concerning the qualitative behavior 
at type two intersections. If the interest rate is sufficiently small, we can see from 
(8.5) that eigenvalues will be complex and the dynamics must be an unstable spiral. 
However, unless r is actually zero, there is the possibility that roots will be real 
(both positive). We comment on this further when constructing phase diagrams. 

The reader might think there could be limit cycles in a neighborhood of the 
unstable steady states. However, it can be shown using Green’s theorem for integra- 
tion in the plane that such cycles cannot occur in problems of this type as long as r 
is greater than zero. See for example. Brock and Scheinkman (1977) and Tahvonen 
and Salo (1996). 

We do not attempt a treatment here of the case r = 0. For that case, there are 
open questions about the existence of optima and one must in any case resort to 
an overtaking criteria since the objective function integral may well be unbounded. 
There are sufficiency fheorems for fhe overfaking criferia, buf fhey do nof apply fo 
nonconvex problems such as ours. 

Once fhere are mulfiple candidate steady slates, Ihere are almost certain to be 
multiple solutions to the necessary conditions that satisfy transversality, at least 
for some values of the state variable. Each of these can be thought of as a local 
maximum (or local minimum) of the value function W(.). The main tool that we 
have for choosing the global maximum among these is a generalization of that 
used by Skiba (1978). It is based on use of the Hamilton- Jacobi equation for time 
autonomous problems: 

Maxa H(x, Q(x), a) = rW(x), (8.8) 

where Q(x) represents the optimal stable manifold. Thus, for any candidate locally 
optimal trajectory Qi(x), ai(x), we can associate a candidate value function Vi(x) = 
H(x, Qi(x), Ai(x))/r and the global optimum starting at x will be defined by i(x) = 
arg max Vi(x). 
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Proposition 5: (Candidate Value Funetion Comparison)^ 

Consider two candidate value functions Vi(x) and Vj(x). Suppose 

Qi(x)dxi(x)/dt > Qj(x)dxi(x)/dt (8.9) 

(which obviously holds if dxi/dt = 0), then Vj(x) > Vi(x) with strict inequality if 
u(a) is strictly concave at aj or (8.9) holds with strict inequality. 

Proof: 

Evaluated at any chosen value of the state variable, 

r(Vj - Vi) = u(aj) - u(ai) - Qj dxj/dt + Qi dxidt 

> u(aj) — u(ai) — Qj {dxj/dt — dXi/dt} 

= u(aj) - u(ai) - u'(aj){aj - ai} 

> 0, 

where the first inequality follows from assumption (8.9) and the second from 
concavity of u(.). If (8.9) is strict, the first inequality is strict whereas the second is 
strict if u(.) is strictly concave at aj. Q.E.D. 

9, Three Candidate Steady States 

We know that the simplest generic multiplicity situation will involve three 
candidate steady states so we take up this situation first. Three candidate steady 
states can arise in all three of our cases and as in section 5, we label these states 
a, P and y. We give a relatively complete discussion of case II first and then 
indicate modifications that must be made in the other cases. There are two types 
of ambiguity in the qualitative features of phase diagrams that cannot be resolved 
without further assumptions on functional forms. The first involves the sign of the 
slope of the stable eigenspace at the largest steady state and the second involves 
whether or not the two stable manifolds extend through the entire state space. 

The first is illustrated by a comparison of Eigures 7a-b which depict local 
dynamics around the largest equilibrium (we draw these in a, x space for visual 
convenience). Eigure 7a applies when the largest equilibrium occurs in a region 
where da/dt = 0 is decreasing whereas, Eigure 7b applies if that equilibrium occurs 
where it is increasing. The operational distinction between these situations involves 
whether the optimal loading will be a monotone increasing or decreasing function 
of the phosphorous stock in this region. 

The second ambiguity is illustrated by comparing Eigures 8a-c.^ Eor Case II 
we can always associate the lowest equilibrium with an oligotrophic state and the 
highest with a eutrophic state since the lowest occurs below x and the highest above 
X. In Eigure 8a the oligotrophic stable manifold extends over the entire state space, 
in Eigure 8c, the eutrophic stable manifold extends back to the lowest states and in 
8b, both manifolds originate from the unstable equilibrium. Using Proposition 5, 
we can now state the following: 
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Figure 7a. Phase diagram: a = 0 curve decreasing at steady state. 




Proposition 6: 

(1) If Figure 8a applies, the oligotrophie state is long run optimal from any starting 
state. (2) If Figure 8e applies, the eutrophic state is long run optimal from any 
starting state. (3) If Figure 8b applies, there is a Skiba point (S) in the interval [.] 
such that the oligotrophie state is long run optimal from starting points below S 
and the eutrophic state is optimal for starting points above. 

Proof: 

To establish (1) apply proposition 5 at any state greater than or equal to y with i 




Figure 8. 8a: Phase diagram: oligotrophic steady state a, is optimal. 8b: Phase diagram: oligo- 
trophic steady state a is optimal for initial Xq below S; y is optional for initial Xq greater than 
S. 8c: Phase diagram: eutrophic steady state y, is optimal. 
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representing the eutrophie manifold and j the oligotrophie manifold (Qi < Qj and 
dxi/dt < 0) and use eonneetedness property of optimal paths. To establish (2) apply 
the proposition to any state less than or equal to ol with the reverse orientation. 
Statement (3) is obvious sinee the oligotrophie manifold must be optimal to the 
left of [.], the eutrophie manifold to the right and there can be only one switch 
point. Q.E.D. 

Actually, we can show that for Figure 8b, the Skiba point is strictly inside the 
interval [.] if u(.) is strictly concave. This is because we can use Proposition 5 in 
the same way as above to show that the oligotrophie manifold is strictly optimal at 
the left point of the interval and the eutrophie manifold similarly strictly optimal at 
the right point. 

Obviously, it is quite important for policy to know which of these three situa- 
tions apply. We can say using a continuity argument that if B is sufficiently close to 
B* (from below) then situation 8a must apply since at B* the oligotrophie manifold 
is strictly optimal. Beyond this, we are unable to say anything definitive without 
knowing more about functional form. 

We comment briefly on modificalions for cases I and III. The sfory is preffy 
much unchanged for case I excepf fhaf fhere is no obvious way of locafing fhe posi- 
tions of a and y relative fo x^, excepf for B close fo B* where again a confinuify 
argumenf will guaranfee fhaf ol is less fhan Xm. For case III, fhe dx/df = 0 curve 
cufs fhe horizonfal axis so sifuafion 8a cannof hold. This makes sense since if fhe 
phosphorous level is sufficienlly high we know fhere is no policy fhaf will refurn 
us fo fhe oligofrophic slate. Generically, fhere is no unslable steady slafe since bolh 
fhe dx/dl = 0 and fhe da/dl = 0 curves cul fhe horizonfal axis in fhe region where if 
would ordinarily be found. This leaves fhe question of if and where we will find a 
Skiba poinf in siluafions where fhere are Iwo locally slable equilibria. If lurns oul 
fhaf Ibis poinf (if if exisls) is always sfricfly interior fo fhe region where refurn fo 
fhe clean sfeady slafe is possible, implying fhaf if we are sufficienlly close fo fhe 
boundary, if is oplimal fo cross fhe poinf of no refurn and converge ullimalely fo 
fhe dirfy steady slafe. This can be seen wilh fhe aid of Figure 9, where fhe sifuafion 
wilh Iwo locally slable sfeady slafes is drawn. 

The locally slable clean steady slafe manifold musl cul fhe X axis fo fhe lefl of 
breakeven poinf x since if would have fo be locally vertical anywhere if crossed fhe 
dx/dl = 0 line. Then, since an optimal palh musl exisl from any slarling slate, fhe 
olher locally slable manifold musl extend al leasl fhaf far info fhe region fo fhe lefl 
of X. As drawn, fhere is a Skiba inlerval and fhe Skiba poinf musl lie somewhere in 
fhe interior of fhaf inlerval. The reader should also see fhaf if is possible fo draw fhe 
diagram so fhaf fhe dirfy steady slafe is fhe optimal deslinalion slarling from any 
inilial slock (so fhere would nol be a Skiba poinf). Presumably Ihis Iasi sifuafion 
will apply when B is sufficienlly small so fhaf relalively little weighl is pul on fhe 
preferences of enjoyers. 
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Skiba 

point 

Figure 9. Phase diagram: Case 3 with two steady states, one Skiba point. 

10. Five or More Candidate Steady States 

The situation with more than three multiplicities and be analyzed in much the same 
way as when there are three, using proposition 5. In a sense (except in case III), all 
possibilities can occur. That is (for five candidate steady states) there can be zero, 
one or two Skiba points. When there are zero, any of the three locally stable mani- 
folds can be the global optimum and when there is one, it can occur between the 
first and second stable candidates or between the second and third and any pair of 
manifolds can have basins of attraction. We illustrate in Figures lOa-c by drawing 
the dynamics (this time for case I) in which the middle locally optimal steady state 
is always long run optimal (10a), where the first and third have basins of attraction 
(10b) and where all locally stable steady states have a basin of attraction (10c). We 
leave it to the reader to fill in ofher cases. 

One could proceed formally by induction fo analyze seven or more candidafe 
steady slates and show lhal again, all combinalions remain possible bul we leave 
Ibis exercise lo Ihe sufficienlly interested reader. 

11. Conclusions 

In Ibis paper, we have characferized Ihe lypes of behavior lhal can be optimal in 
Ihe lake model of Carpenter el al. (1999) and shown how Ihese behaviors change 
as functions of key parameters such as Ihe relative weighl pul on affected groups. 
However, particularly in cases where Ihere are multiple candidate steady slates we 
are unable lo resolve some key ambiguities such as Ihe existence and location of 
Skiba poinls and Ihe resulting sizes of basins of allraclion. Unforlunalely, Ihese 
ambiguities seem lo be characteristic of Ihe class of nonconvex problems of which 
ours is an example. When Ihere are multiple local optima, if is rarely possible lo 
choose Ihe besl among Ihem excepl by direcl numerical comparison. 
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A second ambiguity that we have uncovered involves the number of candidate 
steady states. Drawing intuition from the constant loading model, we might expect 
at most two basins of attraction and it is not clear whether the possibility of more 
than two is likely to be observed. We know that for some standard functional forms 
for f(.) and c(.) there will not be more than two such basins but whether or not such 
behavior is robust or not is an open question. 

Although we couched our analysis in terms of a model of lake eutrophication, 
there are many other areas where similar behavior exists at the stylized level treated 
here. What is required is a destabilizing positive feedback mechanism that gener- 
ates multiple basins of attraction in state space. Examples include (1) fisheries and 
other renewable resource dynamics (Clark 1990) wherein stock reductions due to 
fishing reduce sfock and reproducfion rafes generating a desfabilizing feedback 
on fishing sfock, (2) dynamics of climafe change (Keller el al. 2001; Deulsch 
el al. 2002) wherein global warming by infroducing cold wafer info Ihe Norlh 
Allanlic (from ice melf) inhibils Ihe gulf sfream flow of warm wafer generating 
a desfabilizing furlher cooling effecl. Further potential examples include Ihe inva- 
sion of woody shrubs info grassland savannahs (Walker el al. 1969), and olher 
eulrophying processes in fjords and rivers (Naevdal 2001). Climate change is a 
particularly inleresfing case in view of Ihe debates over potential collapse of Ihe 
Norlh Allanlic Thermohaline Circulation (Deulsch el al. 2001), a basin of allracfion 
in which Europe enters a new ice age! 

Indeed Ihe lake example sludied in Ihis paper can serve as a sorf of malhemalical 
melaphor for Ihe problem of optimal managemenf of any dynamical syslem where 
Ihere are Ihresholds, desfabilizing feedback and potential alternative slable sfales, 
some of which are much heller lhan olhers from a welfare poinf of view. Allhough 
our model is loo stylized lo describe any of Ihese silualions precisely, we Ihink 
if can provide insighl info expected qualilalive behaviors. Managemenl of such 
systems becomes especially Ireacherous when Ihere is a slow variable lhal acls as 
a bifurcation parameter for Ihe faster moving dynamics sludied in Ihis paper. The 
maximum value of Ihe feedback response rate (f ) plays such a role in our lake 
model. Il is roughly governed by Ihe amounl of sedimented phosphorous in Ihe 
lake; as Ihis increases, max f passes Ihrough Ihe cleansing parameter value (b) and 
a bifurcation occurs wherein new basins of allraclion suddenly appear. Eudwig, 
Carpenter and Brock (2002) sludied a slow/fasl dynamical system model of lake 
eulrophificalion by moslly numerical melhods and observed such a bifurcation. An 
imporlanl problem for fulure research lo gel analytical resulls for such systems. 

Anolher imporlanl problem for fulure research is Ihe sludy of optimal manage- 
menl when Ihere is uncertainly aboul which model drives Ihe dynamical system 
and whelher Ihere is a slow variable presenl lhal can create alternative slable slates 
in Ihe faster dynamics acting al Ihe frequency typically considered by managers 
driven by economic incentives. 
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Notes 

1. It is interesting to note that a concave version of this problem for which a sufficiency theorem 
holds requires that f be convex rather than concave. The reason is that the state variable here is a 
“bad” which reverses sign conventions. Of course, when this theorem holds there can he only one 
local optimum satisfying transversality and indeed with f(.) convex, there is only one solution to 
(5.10). However, the nonconvex element is fundamental to our problem and a concave segment 
of f(.) is well established empirically. 

2. This must be possible since we assumed that f (x) goes to zero in x. 

3. This proposition can be generalized; it holds for any time autonomous control problem for which 
the accumulation equation is linear and separable in the control variable, and the felicity function 
is concave in the control for each fixed value of the state variable. 

4. We have drawn these consistent with Figure 7a but the same analysis applies to Figure 7b. 
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Abstract. Ecological systems such as shallow lakes are usually non-linear and display discontinui- 
ties and hysteresis in their behaviour. These systems often also provide conflicting services as a 
resource and a waste sink. This implies that the economic analysis of these systems requires to solve 
a non-standard optimal control problem or, in case of a common property resource, a non-standard 
differential game. This paper provides the optimal management solution and the open-loop Nash 
equilibrium for a dynamic economic analysis of the model for a shallow lake. It also investigates 
whether it is possible to induce optimal management in case of common use of the lake, by means 
of a tax. Finally, some remarks are made on the feedback Nash equilibrium. 

Key words: ecological systems, non-linear differential games 



1. Introduction 

The purpose of this paper is to develop an eeonomie analysis of the shallow lake. 
Eakes have been studied intensively and the shallow lake model is well tested and 
doeumented, so that the analysis has a direct meaning. However, the lake model 
can also be viewed as a metaphor for many of the ecological problems facing 
mankind today, so that the analysis developed in this paper will have a wider 
applicability. The economic analysis is especially challenging because of the non- 
linear dynamics of the lake (which yields non-convex decision problems) and the 
gaming aspects related to the common property character of the lake. 

It has been observed that shallow lakes, due to a heavy use of fertilizers on 
surrounding land and an increased inflow of waste water from human settlements 
and industries, at some point tend to flip from a clear state to a turbid state with a 
greenish look caused by a dominance of phytoplankton (Carpenter and Cottingham 
1997; Scheffer 1997). The release of nutrients, especially phosphorus, into the 
lake stimulates the growth of phytoplankton and in addition to that, the resulting 
turbidity prevents light to reach the bottom of the lake so that submerged vegetation 

* This research was initiated at a meeting of the Resilience Network which had financial support 
from the MacArthur Foundation. We are very grateful for the advice and comments of (in alpha- 
betical order) William Brock, Steve Carpenter, Davis Dechert, Marten Scheffer, Perry Shapiro, Sjak 
Smulders, Robert Solow, David Starrett, Florian Wagener and the referees. 
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disappears. With the vegetation many species disappear such as waterfleas which 
graze on phytoplankton. It has also been observed that shallow lakes are hard to 
restore in the sense that the nutrient loads have to be reduced below the level where 
the flip occurred before the lake flips back to a clear state. The positive feedback 
through the effect on the submerged vegetation is one explanation for this so-called 
hysteresis effect. 

Ecological systems often display discontinuities in the equilibrium states of the 
system over time. A seminal paper in this area models the sudden outbreak of an 
insect, called the spruce budworm, and the long time it takes before the budworm 
density jumps back to a low number again (Ludwig, Jones and Rolling 1978). 
Technically, this hysteresis effect can be modelled by a non-linear differential 
equation which has multiple steady-states with separated domains of attraction in 
a certain range of the exogenous variable. Other examples of ecological systems 
with hysteresis, among which the lake model, are described in Ludwig, Walker 
and Rolling (1997). 

In the ecological literature, management of shallow lakes is mostly interpreted 
as preventing the lake to flip or, if it flips, as restoring the lake in its original state. 
Rowever, this approach denies the economics of the problem in the sense of the 
trade-offs between the utility of the agricultural activities, which are responsible 
for the release of phosphorus, and the utility of a clear lake. When the lake flips to 
a green turbid state, the value of the ecological services of the lake (e.g., the intake 
of water and recreation) decreases but this situation corresponds to a high level of 
agricultural activities. It depends, of course, on the relative weight attached to these 
welfare components whether it is better to keep the lake clear or not. Note that if it 
is better to keep the lake clear, it is very costly to let the lake flip first because 
of the hysteresis. A second economic issue is that lakes are usually common 
property resources and therefore suffer from sub-optimal use, in the absence of 
coordination. 

The literature on the lake model is rapidly increasing. Carpenter, Ludwig and 
Brock (1999) focus on hysteresis and irreversibility issues. The paper that comes 
closest to this one is by Brock and Starrett (1999). They consider the dynamics and 
the optimal management of the lake and point out the occurrence of saddle-point 
stable steady-states and Skiba points. This paper extends their analysis to Nash 
equilibria, for the game of common property, and to tax policies with the aim to 
internalize the externalities (see below). Brock and de Zeeuw (2002) consider a 
repeated game version of the lake model. They show that the occurrence of “bad” 
Nash equilibria can in fact be beneficial because with these points as threats in 
trigger strategies, cooperation can be sustained for lower values of the discount 
factor. 

In the first part of the paper, very simple welfare analysis is done on the possible 
steady-states of the lake model. Relative weights are chosen such that it is optimal 
to manage the lake in one of its clear states, called oligotrophic states. It is shown, 
however, that when the lake is shared by more than one community, two Nash 
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equilibria occur: one in an oligotrophic state and one in a dirty state, called a 
eutrophic state. In the second part of the paper, intertemporal welfare is maximized 
subject to the dynamics of the lake. It is shown that in case the discount rate is low 
enough, an optimal path for phosphorus loadings exists, from each initial condition 
of the lake, which moves the lake towards its optimal steady-state. When the lake 
is shared by more than one community, a non-linear differential game has to be 
solved. The phase-diagram for the open-loop Nash equilibrium has three steady- 
states, two of which are saddle-point stable and correspond to the Nash equilibria 
found in the first part of the paper. The third point is unstable and displays complex 
dynamics. However, it is shown that a so-called Skiba point exists which splits the 
possible initial conditions of the lake in an area from where the Nash equilibrium 
loading trajectory will approach the oligotrophic saddle-point, and an area from 
where the eutrophic saddle-point results. 

The question arises whether it is possible, by levying a tax on the loading of 
phosphorus, to induce the communities to follow an optimal management path. 
Note that if the communities are locked in the eutrophic Nash equilibrium, a 
straight path to the optimal steady-state is not feasible due to the hysteresis. 
Assuming that it is not possible to implement a time-varying tax, the answer 
depends on the number of communities. It is shown that if the number is low 
enough, a constant tax yields a Nash equilibrium path that moves towards the 
optimal steady-state (although this path will not be the same as the optimal manage- 
ment path, of course). If the number is high, however, more saddle-points arise 
again and the dynamics becomes very complex, so that there is no guarantee that a 
constant tax can induce optimal management of the lake in the long run. 

A hnal issue regards the type of Nash equilibrium employed in the analysis. It 
is well-known that the open-loop Nash equilibrium is not strongly time-consistent 
and therefore a feedback Nash equilibrium is preferred. However, due to the non- 
linear dynamics of the lake, it is very difficult to hnd a feedback Nash equilibrium. 
In the last section of the paper, some preliminary remarks are made on this issue. 
The problem is a one-dimensional inhnite horizon differential game, so that the 
techniques developed by Tsutsui and Mino (1990) for dynamic duopolies with 
sticky prices, may apply. This would imply the occurrence of multiple equilibria, 
possibly with welfare levels close to optimal management. The complete analysis 
is left for further research. 

The paper is organized as follows. Section 2 describes the shallow lake model. 
Section 3 is concerned with the economics of the lake steady-states and section 4 
with the dynamic welfare analysis of the lake. Section 4 contains the case of 
optimal management, the open-loop Nash equilibrium, the effect of taxes and the 
feedback Nash equilibrium. Section 5 concludes the paper. 
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2, The Lake Model 



Shallow lakes have been studied intensively over the last two decades and it 
has been shown that the essential dynamics of the eutrophication process can be 
modelled by the differential equation 

P{t) = Lit) - sP{t) + r , P(0) = Po, (1) 

P {t) + nv^ 



where P is the amount of phosphorus in algae, L is the input of phosphorus (the 
“loading”), s is the rate of loss consisting of sedimentation, outflow and sequestra- 
tion in other biomass, r is the maximum rate of internal loading and m is the anoxic 
level (see for an extensive treatment of the lake model Carpenter and Cottingham 
1997 or Scheffer 1997). Less is known about deep lakes but from what is known 
now, it can be expected that the same type of model will be adequate. However, 
estimates of the parameters of this differential equation for different lakes vary 
considerably, so that a wide range of possible values has to be considered. 

By substituting x = P /m, a = L/r,b = sm/r and by changing the time scale 
to rt/m, equation (1) can be rewritten as 



x(?) = a{t) — bx(t) + 



x^(r) + 1 ’ 



x(0) = Xq. 



( 2 ) 



In order to understand some of the important features of this model, suppose that 
the loading a is constant. What happens depends on the value of the parameter 
b. If b > 3v^3/8, all values of a lead to one stable steady-state (see Figure 1). 
If ^ < 1/2, values of a above the local maximum of the curve of steady-states 
in Figure 2 lead to one stable steady-state again. However, values of a below this 
local maximum yield two stable steady-states for the differential equation (2). The 
domains of attraction are determined by the unstable steady-state in between: to 
the right of this point the high stable steady-state results and to the left the low one. 
If 1/2 < b < 3v^3/8, values of a below the local minimum and above the local 
maximum of the curve of steady-states in Figure 3 lead to one stable steady-state. 
For values of a in between, two stable steady-states occur again for the differential 
equation (2), with domains of attraction divided by the unstable steady-state. 

It is easy to see a hysteresis effect now for b < 3v^3/8. If the loading a is 
gradually increased, at first the steady-state level of phosphorus remains low: the 
lake remains in an oligotrophic state with a high level of ecological services. At a 
certain point, however, the Iske flips to a high steady-state level of phosphorus. To 
put it differently, the lake flips to a eutrophic state with a low level of ecological 
services. If it is then decided to lower the loading a in order to try to bring the 
lake back to an oligotrophic state, it is not enough to reduce a just below that flip- 
point. If b is high enough {\/2 < b < 3v^3/8, Figure 3), it can still be done, 
but a has to be reduced further until the lake flips back to an oligotrophic state. If 
^ < 1/2 (Figure 2), however, then the lake is trapped in high steady-state levels 
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of phosphorus which means that the first flip is irreversible. In that case, only a 
change in the parameter b (e.g., by releasing a certain type of fish and thus changing 
the fauna) can restore the lake. In the sequel of the paper, it is assumed that the 
parameter b = 0.6 so that the lake displays hysteresis but a flip to a eutrophic state 
is reversible. Furthermore, the loading a will not be exogenous anymore but subject 
to control. In section 3, a is still constant and the trade-off is considered between 
the benefits of being able to release that constant amount of phosphorus, on the one 
hand, and the resulting damage to the lake, on the other hand. Section 4 provides a 
full dynamic analysis where a can change over time. 
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Figure 3. Hysteresis, reversible (b = 0.6). O: optimal management; Nj: oligotrophic Nash 
equilibrium; N 2 : eutrophic Nash equilibrium; Fj: flip point; F 2 : flip point. 



3. Economic Analysis of the Lake Steady-States 

Several interest groups operate in relation with the lake, that was modelled in 
section 2. Because the release of phosphorus into the lake is due to agricultural 
activity, farmers have an interest in being able to increase the loading. In that way, 
the agricultural sector can grow without the need, for example, to invest in new 
technology in order to decrease the emission-output ratio. On the other hand, a 
clean lake is preferred by fishermen, drinking water companies, other industry that 
makes use of the water, and people who spend leisure time on or along the lake. In 
general, the lake is used as a waste sink (for example, by farmers in their activity 
as multiple non-point source polluters) and as a resource (for example, by water 
utilities and recreational users). Suppose a community or country, balancing these 
different interests, can agree on a welfare function of the form In a — cx^, c > 0. 
The lake has value as a waste sink for agriculture (In a), for example, and it 
provides ecological services that decrease with the amount of phosphorus (— cx^). 
The parameter c reflects the relative weight of these welfare components. Suppose, 
furthermore, that the lake is shared by n communities or countries with the same 
welfare function. In this section it is assumed that the communities choose constant 
loading levels a,, i = I, ... ,n, and that the amount of phosphorus adjusts instant- 
aneously to its steady-state level. A logarithmic functional form for the welfare 
function is chosen, because it is convenient for the technicalities of the analysis 
and because the optimal management outcome in terms of total loading will be 
independent of the number of communities. This is helpful because the number of 
communities can be varied while the optimal management outcome as a benchmark 
remains the same. It is assumed that the area around the lake is large enough so that 
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adding new communities does not lead to crowding out: the objectives are assumed 
to be additive in the number n. 

Optimal management of the lake requires to solve 



n 

maximize E In fl, — ncx^ s.t. 
1=1 



a — bx + 



+ 1 



n 

= 0, fl = a, . 
1=1 



(3) 



Simple calculus shows that the optimal amount of phosphorus is determined by 



b- 



2x 

(x2 + 1)2 



— 2cx(bx r ) = 0. 

x2 + r 



(4) 



Optimal management, of course, does not necessarily yield an oligotrophic state for 
the lake. If the communities attach a relatively low weight c to ecological services, 
it can be optimal to choose a eutrophic state with a high level of agricultural activi- 
ties. It is easy to show that for large values of c, the optimal management problem 
has one maximum for an x below the flip-point. As the value of c is decreased, 
first a local maximum appears for a high x whereas the global maximum is still 
achieved for a low x, but for c low enough (i.e., c < 0.36) the global maximum 
occurs for a high x beyond the flip-point. In the sequel of the paper it is assumed 
that enough weight (i.e., c = 1) is attached to the services of the lake to make it 
optimal to aim for an oligotrophic state. 

If c = 1, equation (4) yields x* = 0.33 with total loading a* = 0.1. Note that 
the same level of total loading can also lead to the eutrophic state x = 1, if the 
initial amount of phosphorus is in the upper domain of attraction (see Figure 3). 
A flip occurs when total loading is increased to a = 0.1021, so that the lake is 
managed not far from what is called the “edge of hysteresis” (Brock, Carpenter 
and Ludwig 1997). A small mistake may cause a flip with high costs, not only 
directly because of a jump to a high x but also indirectly because of the long return 
path. 

If the communities do not cooperate in managing the lake, it is assumed a Nash 
equilibrium results which requires to solve 



maximize In a, — cx^, i = 



s.t. a — bx -\ — T = 0, 

X2 -I- 1 



(5) 



a = a, . 
1=1 



Simple calculus shows that the Nash equilibrium level of phosphorus is determined 
by 



b- 



2x 

(x^ 4-1)2 



2—x{bx 

n 



X2+ 1 



) = 0. 



( 6 ) 



If c = 1 again and if the number of communities n = 2, equation (6) has 
three solutions, two of which correspond to a Nash equilibrium. The first Nash 
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equilibrium yields x^i = 0.36 with total loading a^i = 0.1012. The lake is 
still in an oligo trophic state but closer to the edge of hysteresis. However, the 
second Nash equilibrium yields an eutrophic state 2 = 1-51 with total loading 
2 = 0.2108. Welfare under optimal management and in the oligotrophic Nash 
equilibrium are comparable, but welfare in the eutrophic Nash equilibrium is much 
lower. Moreover, when the communities are locked into the second Nash equili- 
brium and decide to coordinate, it is much more difficult to reach the optimal 
management outcome, due to the hysteresis. It is not enough to reduce total loading 
to 0.1. It has to be reduced to 0.0898 first, in order to flip back to an oligotrophic 
state, and can then be increased to 0. 1 again. 

If n > 2, these numbers change of course, but it is easy to see that for all b in the 
range with hysteresis and reversibility {Ijl < b < 3v^3/8), on which this paper 
focuses, always two Nash equilibria occur. In fact, equation (6) intersects the curve 
for the lake steady-states with the curve described by {n/2cx){b — Ixjix^ 1)^). 
For b in the range given above, this curve has a negative part for x in a positive 
range. Furthermore, it approaches infinity for x ,1, 0 and it approaches zero from 
above for x ^ 00 . Increasing the number of communities n implies that the curve 
is stretched out but the three intersection points remain, two of which are Nash 
equilibria. 

In the next section the loading a can change over time and the amount of 
phosphorus does not adjust instantaneously to its steady-state level but gradually 
according to equation (2), which turns the optimal management problem into 
an optimal control problem and the static game into a differential game. The 
Nash solutions found in this section return (approximately) as saddle-point stable 
steady-states with solution trajectories that may have to bend around the flip-point. 

4. Dynamic Economic Analysis of the Lake Model 

Suppose that the problem has an infinite horizon, so that the objectives become 

poo 

Wi= I e~^‘[lnai{t) — cx^(t)]dt,i = . ,n, (7) 

Jo 

where p > 0 is the discount rate. 

4.1. OPTIMAL MANAGEMENT 

Optimal management requires to maximize the sum of the objectives W, , subject to 
equation (2) with a = ^ a, . This is an optimal control problem and the maximum 
principle yields the necessary conditions 

1 

h A(t) = 0, i = , n, (8) 

Uiit) 

2x{t) 

X{t) = [{b + p)- + 2ncxit), 

(x2(t) -b 1)2 



( 9 ) 
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with a transversality condition on the co-state A, and equation (2). Using (8), 
equation (9) can be rewritten as a set of identical differential equations in a, , i = 
1, ... ,n. The sum of these equations (or multiplication of one of them by n) yields 
a differential equation in total loading a: 

2x(t) 9 

a{t) = -{{b + p)- TTTTTTTiT^^W + 2cx{t)a^{t). (10) 

(x2(t) -h 1)2 

The solution is given by the set of differential equations (2) and (10), and the 
transversality condition. Note that b = 0.6 (see section 2) and c = 1 (see section 3). 
The phase-diagram in the (x, a)-plane is drawn in Figure 4a. One curve represents 
the steady-states for x and can be recognized as the lake steady-states, which were 
discussed in sections 2 and 3. The other curve represents the steady-states for a. 
Its position depends on the discount rate p. If the discount rate is low enough 
(p < 0.1), this curve intersects the first curve only once in a point that is saddle- 
point stable. If the discount rate is higher, the second curve moves up, it intersects 
the first curve three times, and the analysis becomes similar to the analysis of the 
open-loop Nash equilibrium below. It is assumed here that the discount rate p = 
0.03, which yields the graph in Figure 4a. The steady-state is close to the static 
optimal-management solution in section 3, and converges to that point when the 
discount rate goes to 0. The optimal solution prescribes to jump, at any initial state 
of the lake, to the stable manifold and to move towards the steady-state. Given 
the non-linearity of the problem, it is not easy to obtain an analytical expression 
for the stable manifold but a numerical approximation is not difficult to develop. 
Starting at the steady-state point, the characteristic vector corresponding to the 
negative eigenvalue of the Jacobian matrix determines the direction of the stable 
manifold. Working backwards from the steady-state in small steps, a piecewise 
linear approximation of the stable manifold is then found and the approximation 
gets better the smaller the steps. With Mathematica (Wolfram 1999), the stable and 
unstable manifolds for the set of differential equations (2) and (10) can be drawn 
(see Figure 4b). Note that the stable manifold can be reached from all initial states 
xo and bends around the lower flip-point (see also section 3). 



4.2. OPEN-LOOP NASH EQUILIBRIUM 

The open-loop Nash equilibrium (Ba§ar and Olsder 1982) is found by applying the 
maximum principle to each objective Wi,i = separately (fixing aj for 

i 7 ^ i) subjecf fo equafion (2) wifh a = X]a, . The sef of necessary conditions 
becomes 

— ^ -h A;(0 = 0, f = 1, . . . , n, (11) 

ai(r) 

]A,(t) + 2cx{t), i = I, . . . , n, 



^i(t) — + p) — 



(x2(0 + 1)2 



( 12 ) 





ment. 
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with transversality conditions on the co-states A,-, and equation (2). Using (11), 
equations (12) can be rewritten as differential equations in a, , i = 1, . . . , n. These 
equations are identical and the sum (or multiplication of one of them by n) yields 
a differential equation in total loading a: 

2x(t) c 9 

a{t) = -[(b + p)- + 2-x{t)a^{t). (13) 

The open-loop Nash equilibrium is given by the set of differential equations (2) and 
(13), and the transversality conditions. The phase-diagram for two communities 
n = 2 (and b = 0.6, c = 1, p = 0.03) in the (x, a)-plane is drawn in 
Figure 5a. The steady-state curves for x and a now have three intersection points. 
The intersection points on the left and on the right are saddle-point stable and 
yield possible steady-states for the Nash equilibrium in an oligotrophic and in 
a eutrophic area, respectively. The intersection point in the middle is unstable 
with complex eigenvalues. Again with Mathematica (Wolfram 1999), the stable 
and unstable manifolds for the set of differential equations (2) and (13) can be 
drawn (see Figure 5b). The trajectories of the stable manifold curl a while from the 
intersection point in the middle and then go either to the steady-state on the left or 
to the steady-state on the right. It is clear that when the initial state xq lies to the 
right of the set of curls, the open-loop Nash equilibrium follows the upper trajectory 
to the steady-state on the right, and when the initial state lies to the left of that area, 
it follows the lower trajectory to the steady-state on the left. However, it is more 
difficult to see what happens in the range in between. It can be shown (Appendix A) 
that a state x^ exists such that for xq < x^, the open-loop Nash equilibrium jumps 
to the lower trajectory and moves towards the oligotrophic steady-state whereas for 
Xq > xs, it jumps to the upper trajectory and moves towards the eutrophic steady- 
state. The point x^ is called a Skiba point because it was introduced by Skiba in 
an optimal growth model with a convex-concave production function (Skiba 1978; 
Brock and Malliaris 1989). 

If n > 2, the same arguments as in section 3 can be used to show that always 
two open-loop Nash equilibria occur. Note, however, by inspection of equation 
(13), that the arguments do not hold for all b in the range (1/2, 3v^3/8) anymore, 
because of the positive discount rate p, but only forb + p < 3u'3/8, which holds 
for the specific values chosen for b and p. 

Before fuming to the question whether a tax can induce the communities to 
choose loadings according to the optimal management trajectory, it is useful to 
make a few general remarks on the analysis above. First, when comparing equa- 
tions (10) and (13), it is immediately clear that the open-loop Nash equilibrium also 
results under optimal management with parameter c/n instead of parameter c. It is 
an example of a potential game where Nash equilibria can be found by maximizing 
some adapted objective (Monderer and Shapley 1996; Dechert and Brock 1999). 
Second, it also means that all outcomes considered here (optimal management with 
varying relative weight c, and symmetric open-loop Nash equilibria with fixed c 
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but varying number of communities n) can be traced by solving an optimal control 
problem where the set of differential equations, characterizing the solution, has 
a parameter c/n. This parameter can be denoted as the bifurcation parameter. 
It can be shown that in this Hamiltonian system with positive discounting only 
saddle-node and heteroclinic bifurcations can occur (Wagener 1999; Brock and 
Starrett 1999). Third, Figure 5b contains similar dynamics as found in a model 
for external economies with multiple steady-states (Krugman 1991; Matsuyama 
1991). Krugman (1991) argues that for initial conditions (history) to the left and 
to the right of the set of spirals, the economy moves to the left or to the right, 
respectively, but that for history within the set of spirals, expectations determine 
where the economy will end up. This is not an answer to uncertainty but since for 
these initial conditions either a path can be chosen that goes to the left or one that 
goes to the right, something must determine where the economy will end up. In 
our model, however, the solution is driven by an objective. A full analysis of the 
value function shows that the initial conditions determine the outcome, so that only 
history matters. A Skiba point exists that divides the area of initial conditions into 
an area that is attracted by the steady-state on the left and an area that is attracted 
by the steady-state on the right. 

4.3. TAXES 

Consider the case of achieving the unique steady-state amount of phosphorus under 
optimal management by a tax x on phosphorus loading. Under the tax scheme the 
objectives (7) of the open-loop differential game change to 

POO 

Wi= I e~^‘[\nai(t) — — cx^(t)]dt,i = 1, . . . ,n. (14) 

Jo 

The maximum principle requires for the optimal choice of phosphorus loading at 
each point in time that 

—^-T{t) + Xi(t) = 0,i = \,...,n. (15) 

ai(t) 

In order to obtain the loading that corresponds to optimal management, it is imme- 
diately clear by comparing (8) to (15) that the tax on loading should be chosen such 
that r(t) = —X(t) -b A,(r). This implies that the tax bridges the gap between the 
social shadow cost of the accumulated phosphorus X{t) and the private shadow cost 
of the accumulated phosphorus A, (t) that causes the steady-state phosphorus levels 
in the open-loop Nash equilibrium to exceed the (unique) steady-state phosphorus 
level under optimal management. The tax rate, however, is time-dependent, since 
it has to equalize cooperative and non-cooperative loading at every point in time. 
Although optimal, such a tax will be very difficult to implement, since it would 
require a regulating institution to continuously change the tax rate. Another, more 
realistic, approach would be to choose a fixed tax rate on loading, defined such that 
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the non-cooperative steady-state phosphorus level under the constant tax equals the 
steady-state phosphorus level under optimal management. This tax will be called 
the optimal steady-state tax (OSST). 

By comparing (9) to (16) and using (10), it is easy to see that the OSST r* is 
given by 



{n - 1)A* _{n-l) 
n a* 



(16) 



where A* is the value of the co-state and a* is total loading in the steady-state 
under optimal management. Under this constant tax scheme, the open-loop Nash 
equilibrium will be given by the set of differential equations (2) and, instead of 
(13), 



'2,x(t^ T* c 

a(t) = -[(b + p)- TTTTTTTiTK^W (17) 

-1-1)"^ n n 

with a transversality condition. 

It is easy to check that the steady-state (x*, a*) for the set of differential equa- 
tions (2) and (10) under optimal management is also a steady-state for the set of 
differential equations (2) and (17) in the open-loop Nash equilibrium under the 
constant tax r*. By substituting a(t) = a* and r* = (n — l)/a* in the second 
term between brackets of the right-hand side of equation (17), this term reduces to 
a*/n. It is then easy to see that (x*, a*) is also a point on the curve representing the 
steady-states for total loading a in the open-loop Nash equilibrium under the OSST. 
However, the rest of this curve differs from the one under optimal management. 

It should be made clear that the OSST leads to the optimal management steady- 
state but the path under the OSST that determines the transition to the steady- 
state is not the same as the optimal management path. Coincidence of the optimal 
management path and the regulated path requires to use the time-dependent tax. To 
put it differently, the stable manifold of the optimal management problem is not the 
same as the stable manifold of the regulated problem, although both approach the 
same saddle-point. Note also that the time of convergence to the steady-state under 
the OSST will be different than under a time-dependent tax or another control 
scheme. 

If the number of communities n = 2, the phase-diagram under the OSST in 
the (x, a)-plane is drawn in Figure 6a. Although this figure differs from figure 4a 
for optimal managemenf, if is qualifafively fhe same. If has one saddle-poinf and 
a corresponding sfable manifold. S farting af bofh unregulafed Nash equilibrium 
sfeady-sfafes, fhe fwo communities will change their loadings under the OSST and 
the equilibrium path will follow this stable manifold and move towards the optimal 
management steady-state. Starting at the oligotrophic Nash equilibrium, this is a 
short trajectory, but starting at the eutrophic Nash equilibrium, the path has to bend 
around the flip-point. 
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X 



Figure 6c. Phase diagram, optimal steady-state tax (n = 100). 



Increasing the number of communities n, at a certain point the phase-diagram 
under the OSST becomes very complicated. From equation (17), the curve 
representing the steady-states for a is given by 



a = 



(b + p) 



2x 



+ [(b + p) - 



2x -1 («— 1 ) ■ 
+ na* 



(18) 



The denominator of the right-hand side of equation (18) is zero if and only if 



2cx 
(n - 1) 



+ [{b + p) 



2x 



(x^ + 1)^ a 



]A = o. 



(19) 



For n = 2 the left-hand side of equation (19) is positive, but for n >7 equa- 
tion (19) has two roots which yield two vertical asymptotes for the curve given 
by equation (18). Note that this phenomenon occurs because the term between 
brackets is partly negative which is caused by the specific choice of b and p (see 
also section 4.2). Moreover, if n goes to infinity, the curve approaches a = a*, but 
this convergence is not uniform, due to the two discontinuities. An example of such 
a phase-diagram under the OSST is drawn in Figure 6b where n = 10. This case 
still has only one saddle-point stable steady-state. If n gets large, it is to be expected 
that the curve has more intersection points with the curve representing the lake 
steady-states, because of the convergence to a*. This implies the possibility that 
multiple steady-states occur under the OSST. An example is drawn in Figure 6c 
where n = 100. This case has three steady-states again, two of which are saddle- 
point stable, whereas the middle one is unstable. For a lower discount rate p and a 
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higher number of communities n, it may happen that two more steady-states occur 
between the asymptotes, one unstable and one saddle-point. The existence of a 
second steady-state characterized by saddle-point stability leads to the conclusion 
that if the number of communities n is high, the optimal steady-state tax may not 
work. Depending on the initial conditions, the OSST may direct the equilibrium 
path towards a steady-state with a higher phosphorus level than in the optimal 
management steady-state. 



4.4. FEEDBACK NASH EQUILIBRIUM 

The open-loop Nash equilibrium is weakly time-consistent but not strongly time- 
consistent which implies that the equilibrium is not robust against unexpected 
changes in the state of the lake (Ba§ar 1989). To obtain an equilibrium with the 
Markov perfect property, the feedback Nash equilibrium has to be found which 
means that the Hamilton-Jacobi-Bellman equation for the game has to be solved. 
This is a difficult problem because it is not clear what the value function will look 
like due to the complexity of the lake model. 

In a linear-quadratic framework with quadratic value functions, the solution 
would be analytically tractable. In a problem like the one under consideration, 
one would expect that the steady-state amount of phosphorus and total loadings 
will be higher in the feedback Nash equilibrium than in the open-loop Nash equili- 
brium. This would confirm fhe infuifion derived in similar fype of problems (van 
der Ploeg and de Zeeuw 1992). If a communify knows fhaf fhe ofher communities 
will respond to a higher amount of phosphorus in the lake with lower loadings, it 
loads more at the margin because loading will be partly offset by the reactions of 
the other communities. Since all communities argue in this way, total loading in 
the feedback equilibrium is higher than in case the loadings are not conditioned 
on the state of the lake as in the open-loop Nash equilibrium. However, Tsutsui 
and Mino (1990) have shown, for a dynamic duopoly model with sticky prices, 
that non-quadratic value functions exist that solve the Hamilton-Jacobi-Bellman 
equation. From this it follows that multiple feedback Nash equilibria exist. The 
resulting set of steady-state prices lies near to the price under full cooperation or 
collusion. It implies that feedback Nash equilibria exist with steady-state prices that 
are better for the duopoly than the steady-state price in the open-loop Nash equili- 
brium. This technique was applied to the international pollution control model by 
Dockner and Long (1993) to show that (non-linear) feedback equilibria exist that 
yield lower steady-state stocks of pollution than the open-loop equilibrium. Since 
the characteristics of those problems (infinite horizon and a one-dimensional state) 
are the same as for the lake problem, one may expect that the same technique can 
be applied here. In this section a few steps will be taken but a full analysis is left 
for further research. 

Suppose that the loading strategies in the (symmetric) feedback Nash equili- 
brium are given by a, = h(x),i = The Hamilton-Jacobi-Bellman 
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equation (or dynamic programming equation) for community i,i = 
becomes 



pV {x) = max[ln a, — cx^ + Vx(x)[ai + {n — \)h(x) — bx + 



x2 + 1 



]], ( 20 ) 



where V denotes the value function, that is the same for each community. 
The first-order condition yields 

1 -1 
— = -Vx(x) at := h{x) = 77—. 

Oi Vx(x) 



Substitution of (21) into (20) leads to 



1 



pV (x) = In h{x) — cx^ [nh{x) — bx + 



rl. 



h(x) " ' ' ' -h r 

and differentiation of (22) with respect to x, using (21) again, leads to 
p li'(x) . h'(x)^ , x^ ^ 1 _ 2x 



h(x) h(x) 



h’{x) h’{x) X 

- 2cx + -r^rri—bx + 



h^{x) 



x2 -h 1 



2 1 

] + T^[b- 



h(x) 



(x2 -h 1)2 



( 21 ) 



( 22 ) 



]■ (23) 



Rewriting (23) yields an ordinary differential equation in the feedback loading 
h{x)\ 

x2 2x 

h'{x){h{x) -bx+ ] + h(x)[(p + b) - 2cxh(x) - ] = 0. (24) 

x2 -h 1 (x2 + 1)2 



If the resulting steady-state x^ were known, equation (2) with total loading a = 
nh{x) gives a boundary condition for the differential equation (24): 



1 x^2 

h{x^) = -{bx^ - ], (25) 

n X ^2 q_ I 

SO that the differential equation can be solved. However, this steady-state is a 
degree of freedom. This means that it is to be expected that multiple feedback 
Nash equilibria exist. If that steady-state is chosen to be equal to the steady-state 
under optimal management, it may yield a feedback Nash equilibrium that sustains 
the optimal management outcome in steady-state. Note that this does not imply 
that the same trajectory to the steady-state is followed: welfare will generally still 
be different. However, it is to be expected that a feedback Nash equilibrium exists 
that is better than the open-loop one. How can this result be reconciled with the 
intuition described above and the conclusion that the feedback Nash equilibrium 
is worse than the open-loop one? Note that the analysis above was restricted to 
quadratic value functions in a linear-quadratic framework so that linear controls 
result. It seems that by enlarging the strategy spaces to non-linear controls (with 
non-quadratic value functions), equilibria arise that are better. This may be recog- 
nized as a type of folk-theorem in differential games. Further research is needed to 
be able to give full answers to these issues. 
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5. Conclusion 

Economics of ecological systems is a much neglected area in the literature. Further- 
more, the complex dynamics of these systems and the common property aspect of 
the ecological services as resource and waste sink, present interesting challenges 
to economic theory. This paper focuses on the shallow lake, as an example hut also 
because much is known about shallow lakes in the ecological literature. However, 
the analysis in this paper applies to all models that are driven by convex-concave 
relations, and models with this feature are very typical for mathematical models in 
ecology (see Murray 1989). 

Internal loading of phosphorus in shallow lakes causes the lake model to be 
non-linear, with hysteresis effects in the more interesting cases. As a consequence, 
even if optimal management of the lake has only one steady-state with saddle- 
point stability, either an increase in the discount rate or an increase in the number 
of communities, sharing the lake, leads to more saddle-points and complicated 
dynamics in between. However, a Skiba point exists, which means that in these 
cases the initial level of accumulated phosphorus determines whether the lake will 
end up in a clear or a turbid state. For a small number of communities, a constant tax 
on the loading of phosphorus can induce optimal behaviour and a return to a clear 
state, but for a large number this policy may not work. The analysis employs the 
open-loop Nash equilibrium to characterize non-cooperative behaviour. The feed- 
back Nash equilibrium would be more appropriate but is very difficult to identify 
for these type of problems. Some first steps are given but a full analysis is left for 
further research. 
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Appendix A: The Skiba Point 



As also noted later in the main text, the open-loop Nash equilibrium results from 
maximizing the welfare objective (a potential function) 

poo ^ 

/ e-P'[y^\naiit)-cx^it)]dt. (Al) 

i=t 

What follows is strongly based on Wagener (1999). 

Define the Hamiltonian function 

H = g{x, fli, . . . , a„) + ixf(x, a\, . . . , an), (A2) 



where 



g{x, fli, . . . , fl„) = In ai — cx^, f(x, a\, . . . , a„) — a, — bx ■ 
1=1 1=1 

and define the current value Hamiltonian function 

H = g(x, fli, ... , fl„) -F A/(x, fli, ... , fl„), X = IX. 



x^ + 1 



(A3) 



(A4) 



The maximum principle yields the necessary conditions ( 8 ), (9) and (2) with the parameter 
c replaced by c/n, which then yields the set of differential equations (2) and (13) for the 
open-loop Nash equilibrium with the phase-diagram given in Figure 5a and the stable and 
unstable manifolds in Figure 5b. Denote the x-coordinate of the oligotrophic steady-state as 
xi and of the eutrophic steady-state as X 4 , and denote the range with curls as [x 2 , X 3 ], where 
X 2 and X 3 are the x-coordinates of the intersection points of the outer upper curl and the 
outer lower curl with the curve representing the steady-states for x(/ = 0 ), respectively. 
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Along trajectories, it holds that 



dH_dHdx dHdfji dH 
dt dx dt dfx dt dt 



dHdH dHdH 
dx dfx 9/r dx 



dH 



= -pe '’‘g- 



(A5) 



It follows that 



noo /’CXD 

H{Q) — H{Q) — — I ——dt = / pe~^’ g dt = pW. 

Jo dt Jq 

Furthermore, 

dW _ dW dt _ I dH I _ I dH dH 1 _ 

dx dt dx p dt f p dx ^ dx f 

Condition ( 8 ) yields 



X — , i — 1 , . . . , n, • 

a. 



X — (a — } at). 

n ‘ ^ 



/=! 



(A 6 ) 



(A7) 



(A 8 ) 



The proof of the existence of a unique Skiba point takes four steps. 

1) Suppose the initial condition xq is in the interior of the range [x2, X3]. 

It is better to jump immediately to the upper trajectory instead of to the same trajectory 
some point earlier, or to the lower trajectory instead of to the same trajectory some point 
earlier, because the welfare difference 

r dw f r n 

I dx = / Xdx = / dx < 0. (A9) 

J dx J J a 

2) Suppose the initial condition is X 2 - The choice is either to jump to the upper trajectory 
and start at the intersection point with / = 0 or to jump to the lower trajectory, by a proper 
choice of intial loadings a. Because 



dW IdH 1 A a 1 A 1 In^ 

+ ^/) = - —f = 2 

da p da p dai p ar P a^ 

1=1 /=l 



(AlO) 



and / < 0 below the intersection point, the welfare difference between the upper trajectory 
and the lower trajectory is negative, so that it is better to jump to the lower trajectory at X 2 - 

3) Suppose the initial condition is X 3 . The choice is either to jump to the lower trajectory 
and start at the intersection point with / = 0 or to jump to the upper trajectory. Because 
above the intersection point / > 0, it follows from equation (AlO) that the welfare differ- 
ence between the upper trajectory and the lower trajectory is positive, so that it is better to 
jump to the upper trajectory at X 3 . 

4) Compare now the upper trajectory leading to steady-state on the right, with co-state 
X 2 , and the lower trajectory leading to the steady-state on the left, with co-state . Denote 
the welfare difference as AW. Using the results in steps 2-3 and equations (A7)-(A8), it 
follows that 

d 

AW(x2) < 0, AW(x3) > 0, —AIT = A 2 - ki > 0. 

dx 



(All) 
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From (A1 1) it follows that a point x$ in the interior of the range [x 2 , A 3 ] exists, such that 

AVF(a 5 ) = 0; AW(x) < 0,x e [x 2 , x$)', AW{x) > 0, x e (xs, A 3 ]. (A12) 

The point x$ is called a Skiba point and (A12) implies that if the initial amount of phos- 
phorus AQ is on the left-hand side of the Skiba point, the equilibrium jumps to the lower 
trajectory and moves towards the steady-state on the left and if the initial amount of phos- 
phorus AO is on the right-hand side of the Skiba point, the equilibrium jumps to the upper 
trajectory and moves towards the steady-state on the right. 




Multiple Species Boreal Forests - What Faustmann 
Missed* 

ANNE-SOPHIE CREPIN 

The Beijer International Institute of Ecological Economics, The Royal Swedish Academy of 
Sciences, Box 50005, 10405 Stockholm, Sweden (E-mail: asc@beijer.kva.se) 



Abstract. Recent research in natural sciences shows that the dynamics in boreal forests are much 
more complex than what many models traditionally used in forestry economics reflect. This essay 
analyzes some challenges of accounting for such complexity. It shows that the optimal harvesting 
strategy for forest owners is history dependent and for some states of the forest, more than one 
strategy may be optimal. This paper confirms earlier literature on shallow lakes and coral reefs and 
shows that this kind of phenomena seem much more common than previously thought. They are 
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of some specific function to model the non-linearity. There are also indications that theses results 
could be obtained even for resources with concave growth if at least one species with non-linear 
growth affects their dynamics. 

Key words: forestry, multiple steady states, non-linear, Skiba points 



1. Introduction 

Research inspired by Holling (1973) and May (1977) showed that ecosystem 
dynamics were complex: not accounting for them would lead to serious surprises. 
Non-linearity, interactions between species, disturbances, and threshold effects 
were some examples of patterns that play a crucial role in ecosystems’ dynamics. 
What does this imply for resource management? 

Theories for economic management of renewable natural resources have 
recently experienced drastic changes due to the model for shallow lakes that 
Scheffer (1998) and Carpenter and Cottingham (1997) produced independently of 
one other. This model accounts for the possibility that such lakes may flip between 
a clear and a turbid state. Brock and Starrett (2003) gave a complete treatment of 
the problem of optimal management of shallow lakes. Maler, Xepapadeas and de 
Zeeuw (2003) provided a dynamic economic analysis of shallow lakes managed 
under common property and developed a method to solve such a differential game. 
Wagener (2003) gave a local criterion that ensured the existence of Skiba points 
(Skiba 1978) - initial states with more than one optimal path - when discount rates 
were small. 

* This paper is a shorter version of Chapter 2 in Crepin (2002). 
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All these contributions are based on the study of shallow lakes and could for 
that reason be criticized for their narrow applicability. After all, shallow lakes are 
not a dominant ecosystem on our planet. Crepin (2002, Ch. 1) showed that similar 
results - existence of multiple interior steady states and Skiba points - could be 
obtained for coral reef fisheries. So many results from fisheries liferafure (Gordon 
1954; Scoff 1955; Clark 1973; Clark 1976) do nol necessarily hold in fhe presence 
of fhreshold effecls like in coral reefs. Coral reefs cover a small area on our planel 
buf fhey play a key role as a nursery for fish living in fhe open seas (Hughes el 
al. 2003). So Ihese resulls may have larger consequences for human beings lhan 
previously Ihoughl. 

Anolher crilicism againsl fhe shallow lake liferafure, which applies lo fhe coral 
reef article as well, is lhaf non-linearily is modeled in fhe same way in all fhe 
articles. Each of Ihem uses a specific sigmoid function fo model fhreshold effecls. 
If we are unlucky, fhe resulls oblained in fhe shallow lake and coral reef liferafure 
could be due lo Ibis function’s special fealures, which may have little in common 
wilh real phenomena. 

This article lakes up bolh Ihese criticisms. If shows lhaf fhe resulls from fhe 
shallow lake liferafure can be oblained even for boreal foresls - lerreslrial ecosys- 
lems, also called taiga, lhaf occupy a wide bell around fhe arctic circle in fhe 
norlhern hemisphere. If also shows lhaf Ihese resulls do nol depend on fhe use 
of fhe specific sigmoid function. 

Extensive liferafure exisls on ecosystem modeling and managemenl where 
several species inleracl. In parlicular, Paslor and olhers described fhe complexify 
of boreal foresl ecosyslems (Paslor and Mladenoff 1992; Paslor el al. 1996; Danell 
el al. 1998). They poinled oul fhe imporlance of fhe inleraclion of species and 
non-linearities. Paslor el al. grasped fhe mosl imporlanl dynamics in boreal foresls 
by using a system of Ihree differential equations lhal represenl Ihe Ihree-species’ 
dynamics. This paper uses a slighlly modified version of Paslor el al.’s model. The 
model is used lo analyze Ihe challenges lhal ecosystem complexify implies when 
calculating managemenl rules for foreslry. 

This paper derives some optimal managemenl rules lo guide foresl owners or 
decision makers who harvesl several species and also value Ihe slanding foresl for 
recreational purposes. The harvesting rules are continuous, which means lhal Ihe 
decision maker calculates how much il is optimal lo harvesl al each poinl in time. 
The boreal foresls are considered as any olher ecosystem lhal produces renewable 
resources. So a similar approach as in Ihe fisheries lileralure (Clark 1976) can be 
used lo find oul optimal harvesting palhs. This is nol usual in Ihe foreslry lileralure. 
Models of foreslry managemenl - initiated by Eauslmann (1849), Pressler (1860), 
and Ohlin (1921) - typically assume lhal il is optimal lo harvesl Ihe whole sland 
al discrete time intervals. These models resull in rules lhal determine when Ihe 
whole sland of one specific species should be harvested. Bergland, Ready and 
Romslad (forlhcoming) made recenlly an interesting conlribulion lo Ibis lileralure. 
They sludied optimal managemenl of moose and pine in Norway and calculated an 
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optimal rotation period and optimal moose harvest in a model in which there was 
feedback between moose and pine. Crepin (2002, Ch. 2) determined the specific 
conditions required for whole stand harvesting to be optimal and calculated optimal 
harvesting rules in the special case studied here. The optimal rotation period has 
typically different length, which depends on the presence of other species in the 
forest. The non-convexity of the growth function for pine implies that several 
rotation periods may be optimal. 

Section 2 discusses modelling issues, presents the ecosystem model of a boreal 
forest, and analyzes its dynamic properties. Section 3 models different harvesting 
regimes. Section 4 presents the computer simulations. Section 5 presents the 
conclusion. 

2, A Three-species Boreal-forest Model 

A good ecosystem model should grasp the most important dynamics in the real 
ecosystem and still be simple enough to give useful understanding. What character- 
izes boreal forests? Compared to many other ecosystems (coral reefs, rainforests), 
most boreal forests host few species. A small amount of these species appear 
in large populations and dominate the forests. Boreal tree species are coniferous 
trees that include spruce and pine and some hardwood species such as aspen 
and birch. They give the forests their special look and influence their dynamics. 
Large browsers (moose, elk) can influence boreal forests’ dynamics because they 
feed on the trees and can damage them. Boreal forests also host various types of 
lower vegetation, which vary depending on humidity and nutrients levels in the 
soil (Bernes 1994; Danell et al. 1998; Pastor et al. 1997). The model used here 
describes only the dynamics of conifers, caduceus trees and large browsers. Simi- 
larly to Pastor et al. (1997), the influence of other species on the forests’ dynamics 
is neglected. This paper uses two models for a boreal forest: a very general model 
and a specific model, which is useful for computer simulations. 

2.1. GENERAL MODEL 

Murray (1993) and Gurney and Nisbet (1998) presented methods to build and 
analyze ecosystem models. The very general three-species model (SYS) could 
represent the dynamics of boreal forests: 

x = Gx(x,y,z) (SYS) 

y = Gyix, y, z) 
z = G,(x, y, z) 

where x, y, and z are vectors that represent categories of browsers (moose), 
caduceus trees (birch), and conifers’ (pine) biomasses, respectively. G, (x, y, z) is 
the natural growth of species or category i G {x, y, z} in the absence of harvesting. 
Such a general model can embed almost any kind of specific dynamics among 
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three species groups. This is useful to model age or space, for example. Negative 
biomass values are impossible and impose the restrictions: 

> 0, j > 0 and z > 0 (1) 

This is equivalent to imposing that Gi(x, y, z) must be non-negative when species 
i is extinct {i = 0). Assume xq, yo, and zo are the initial stock of species at time 
t = 0. If none of the restrictions (1) are binding, and for every i e [x, y, z}, 
Gi{x, y, z) and its derivatives regarding x, y, and z are continuous, then SYS has 
a unique solution:^ 

x(t) =(pAxo,yo,Zo,t) (2) 

y(t) = (py{xo, yo, Zo, t) 

z(t) = (Pzixo, yo, Zo, t) 

In section 3, the SYS model is used to calculate general forestry manage- 
ment rules. Computer simulations require a more precise specification of forests’ 
dynamics. Ideally, one would like to represent forest ecosystems as accurately as 
possible, including all relevant species, their ages, and spatial distribution. Some 
simplifications are necessary because such a representation would be rather diffi- 
cult to work with. This paper focuses on the effects of species’ interactions so the 
model used does not account for age and space. 

2.2. SPECIFIC MODEL 

John Pastor et al. (1997) presented a simplified boreal-foresf model in which 
conifers, caduceus frees, and herbivores inferacfed. A slighfly modified version 
is used here, where pine, birch, and moose are fhe fhree inferacfing species. In 
Pastor’s model, species can move in space. This is nol assumed here, and fhe model 
represenfs one homogenous piece of land. Removing fhe spatial dimension from 
Pastor’s model implies fhaf conifers grow in an unconfrolled way, which is nol 
very realistic. Slill, conifers lend to come ralher lale in fhe succession of species 
and are invasive (Bernes 1994). A convex-concave growlh function for conifers is 
one way to represenl such a fealure. 

For i, j e {x, y, z}, lei r, be species f’s growlh rales and a,y be interaction 
coefficienls of species j on i. Then SYS can be rewrillen wilh growlh funclions 
defined by (3). This specific model is called SYSl. 

Gx(x, y, z) = X - x^ + UxyXy + Ux^xz (3) 

Gy{x, y, z) = ryy - y^ - Uy^xy - Uy^z 

Gj.(x, y, z) = r^z^ - z^ - a^xxz - a^yy 

In SYSl Vx = Uxx = ciyy = = 1. Crepin (2002, Ch. 2) showed lhal models 

wilh general values of Ihese paramelers had dynamic properfies similar to SYSl’s 
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after parameter calibration. So studying SYSl produces necessary information on 
more general models. 

Moose feed on birch and pine, so both tree species have a positive effect 
on moose biomass, which is proportional to tree stocks {a^yxy and UxzXz)- The 
corresponding effect of moose on tree biomass is negative {—Oy^xy and —OzxXz)- 
The negative term —x^ describes the crowding effect that occurs when the moose 
population becomes too large. The negative term —y^ describes that birch is shade 
intolerant. When the density is too high, birch stop multiplying because new plants 
cannot get enough light. These terms are in quadratic form because in the absence 
of any other species moose and birch would have logistic growth by assumption 
and ^ = 1, i e {x, y} would then be the carrying capacity^ for species i. 

In contrast, pine exhibits a convex-concave growth, which is modeled using the 
function — z). When pine biomass is small, young pines establish better with 
increasing biomass, so growth is convex because the term r^z^ dominates. When 
the population becomes larger, competition arises and growth becomes concave 
because the term —z^ dominates. When carrying capacity = 1) is reached, 
more pine leads to negative growth. Any function of the form (r^ — z) has similar 
dynamic properties. So why specify q = 27 First, this is convenient because we 
avoid large exponents or an extra parameter q. Second, q influences the size of the 
domain on which of the growth function for z is convex. Large values of q imply 
a larger domain in which pine growth is convex. This means that if we choose 
q = 2, we depart just a little from the traditional concavity assumption. So if the 
traditional results do not hold for q = 2, they are likely to hold even less for larger 
values of q. The terms Uy^z and a^yy represent the effects of competition between 
species of trees. 

Figure 1 shows what a species’ growth looks like when species interaction is not 
accounted for. Note that Gy{x, 0, z) and G^(x, y, 0) are negative so the constraints 
(1) may be binding for some initial points. Crepin (2002, Ch. 2) describes what 
happens when constraints are binding. 

While a detailed analysis of SYSl’s dynamics can also be found in Crepin 
(2002, Ch. 2), the important features are summarized here. Depending on para- 
meter values, SYSl has up to 15 feasible steady states. Their characteristics are in 
Table 1.^ 

Figure 2 show a three-dimensional phase diagram of SYSl. Table 11 summa- 
rizes the directions of motions in the different regions separated by the manifolds 
Px\{x = 0, X / 0), P,.(y = 0), and P^iz = 0). 

The phase space is also divided into regions so that trajectories that start 
anywhere in one region will end up in the same stable equilibrium. The mani- 
folds called separatrices (Kuznetsov 1995) separate these different regions. The 
separatrices are difficult to locate in the three-dimensional phase space, but Figure 3 
shows the phase diagram when moose, for example, have disappeared. The dotted 
curved line shows the separatrices’ approximate location in this special case. 
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growth 




Figure 1. Growth patterns for moose (bold line), birch (dashed line), and pine (thin line). 



Table I. Steady states properties 



Ecosystem Steady state 



Dynamic properties 



extinction 
one species 



two species 



three species 



SO = (0, 0, 0) 

SG = (1,0,0) 

51^. = (0,r^,0) 

Sl^ = (0, 0, r^) 

S20 = (1 +%j.(^,0, (j>) 

« \a^yayx + l axyUyx+l ) 

S\ = {X{k),Y(k),k) 



one unstable or saddle 

one stable or saddle 
one stable or saddle 
one stable 

one stable, two saddle, one unstable 
one stable, one saddle 
one stable 

one stable, three saddle 



Table II. Directions of motion in different regions 



Region 


X 


y 


z 


I: below Py and P^ 


> 0 


> 0 


> 0 


II: below Py above P^, 


> 0 


> 0 


< 0 


III: below P^i, P^\ above Py 


> 0 


< 0 


> 0 


IV: below P^, Py above P^i 


< 0 


> 0 


> 0 


V: below P^ above and Py 


< 0 


< 0 


> 0 


VI: below P^i above P^ and Py 


> 0 


< 0 


< 0 


VII: below Py above P^ and Py 


< 0 


> 0 


< 0 


VIII: above Py and / j- 


< 0 


< 0 


< 0 
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S3b 

Figure 2. Phase diagram. 




Steady states’ simulations with sets of arbitrarily chosen parameters show the 
fold bifurcations"* that might occur. For example, when the birch growth rate varies, 
six fold bifurcations can be distinguished for values of around 0.5, 0.65, 1.2, 
1.45, 1.84, and 1.86. Figure 4 represents pine biomass in a steady state for different 
birch growth-rate values. It shows clearly the fold bifurcations that occur. For low 
growth rates, no feasible interior steady is stable. Birch cannot maintain itself and 
becomes extinct. Similar results are obtained when the other parameters vary. 
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Pine population for different birch growth rates 




birch growth rate 



Figure 4. Fold bifurcations (pine). 



Computation of the Lyapunov exponents (Lyapunov 1892) for the parameter 
values tested showed no oeeurrence of limit eyeles or ehaos. Nevertheless, 
Takeuehi (1996) studied a slightly different model with two competing preys and 
one predator and found the existence of a Hopf bifurcation^ that lead to periodic 
orbits. He also found that the three species could coexist in chaotic motion for some 
parameter values that correspond to a large effect of predation. Takeuehi ’s results 
prove that cautiousness is required here. We can only rule out the existence of limit 
cycles and chaos for the parameter values tested. 

These results show that the ecosystem is history dependent: its long-term state 
depends on the initial state. Furthermore, external shocks that affect the variable 
stocks or the parameter values influence the ecosystem’s dynamic properties. This 
could lead to crossing a separatrix so that an external shock can drastically change 
the ecosystem’s long-run equilibrium. Harvesting and hunting are examples of such 
external shocks. The rest of the paper examines their effects. 



3. General Management Rules 

What happens when harvesting is introduced in ecosystem SYS? Let h = 
(hi)i(z{x,y,z] be a vector of harvests at time t. To begin with, assume that this vector 
is arbitrary. The SYS system is transformed into: 

X = Gx{x, y, z) - hx 
y = Gy(x, y, z) - hy 
z = G^(x, y, z) - 
x>0,y>0,z>0 



( 4 ) 
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It is easy to verify that harvesting affeets the separatriees’ location and thereby 
the stable states’ basins of attraction. Harvesting may also cause bifurcations; 
the number of equilibria and their dynamic properties can then differ from the 
unexploited ecosystem case. 

Assume the initial stock of species at time t = 0 is xq, yo, and zo- If none of the 
species becomes extinct, then (4) has a unique solution: 



x{t) = ^jc(xo,yo,zo,t) (5) 

y{t) = ^y(xo,yo,zo,t) 
z{t) = ^^(xo,yo,Zo,t) 

Given that owners harvest forest species to increase their welfare, the next step is 
to decide what harvesting rules maximize welfare in each period. 

Assume forest owners accounted for the benefits that they can continuously 
extract from all of the forest’s species. To maximize the forest’s net benefits, they 
wanted to find ouf how much of each species fo harvesf every fime. These benefifs 
were harvesfing profifs and fhe foresf’s nef environmenfal and recreational values. 
Modeling recreational and environmenfal values requires fhaf species are entered 
as sfafe variables direcfly info fhe objecfive function fhaf is maximized. 

Lef ^i(hi) represenf profifs from harvesfing species i and Q.e{x,y,z) represenf 
nef benefifs from environmenfal and recreafional services. Functions and 
were assumed fo be concave. Assume furlher fhaf fhe owners gave differenl posifive 
weighfs Kx, Ky, and fo respecfive nef foresl benefifs. Af fime t, fhe fofal nef 
benefifs were B{h, x, y, z) = + KyQy(hy) + K^Q^{h^) + K^^eix, y, z)- 

Owners faced fhe problem: 

p+OO 

max / B(h, X, y, z)e~^‘dt (6) 

^ Jo 

s.f. X = Gjcix, y, z) - 
y = Gy{x,y,z) -hy 
z = G^{x, y, z) - 
X > 0, y > 0 and z > 0 

Ponfryagin el al. (1964) developed mefhods fo solve such opfimal confrol problems. 
This paper uses fhe mefhod found in Arrow and Kurz (1970, Ch. 2) fo solve such 
problems when fhere are non-negalivify consfrainls on sfafe variables. Hesfenes 
(1966) and Seierslad and SydsEefer (1987) have developed similar mefhods. 

Lef h*(t) = (h*(t))i(z[x^y,z} represenf harvesf choices, which are admissible 
solutions for problem (6). If fhe consfrainf qualificalion (Kuhn and Tucker 1951) 
was Irue Ihen fhere existed funclions of time, A,(t) so fhaf for each t, J{ is fhe 
currenl-value Hamiltonian and J1 is the Lagrange function: 
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Mix, y, z, h, X, t) = + hiGiix, y, z) - /i,)) (7) 

i 

+ Ke^eix, y, Z) 

X(x,y, z,h,k, ix,t) = M(x, y, z, h, X, t) fXiiGiix, y, z) — hi) (8) 



Then h*(t) maximizes M(x, y, z, h, X, t) subjeet to the eonstraints 
Gi(x,y,z) — hi > 0 for all i e {x,y,z}, for whieh i(t) = 0. Further 
Xi = pXi — evaluated at i = i{t), h = h*(t), X = X(t). The Lagrange 
multipliers /U,, must be sueh that for all i, ^ = 0 for (x, y, z) = (x(t), yit), zit)), 
h = h*(t), X = X(t), and fiiit)i(t) = 0, iXi{t){Giix, y , z) — hi) = 0. The 
necessary conditions for h*(t) to be optimal amount to: 

1 ) the equations of motion for the exploited ecosystem, 

X = GAx,y,z) - h^ (9) 

y = Gy(x, y, z) - hy 
z = G^ix, y, z) - h^ 



2) the necessary conditions for optimal harvest, Vi e {x, y, z}, 
d^iih*) 



Ki- 



dhi 



Xi — /X, = 0 or h* = 0 



3) the shadow price equations for each species, Vi, j e {x, y, z}, 

■_ dQe(x,y,z) r, . ydGi(x,y,z) 

— P^j ^ ^ (^i ~l~ l^i) ^ . 

i€{x,y,z] 



( 10 ) 



( 11 ) 



4) the conditions for the multipliers’ non-negativity. 



Vi e {x, y, z}, /x, (i)i(i) > 0, iXiit)(Giix, y, z) - hi) = 0 (12) 



Conditions (10) are sufficient for optimal harvests because the profits from 
harvests are additively separable and concave in each harvests, which implies that 
M is concave in harvests. Proposition 1 follows directly from equation (10): 



Proposition 1 

The optimal size for each species ’ harvest is such that the marginal value from 
harvesting more of the species equals the marginal value of retaining more of it in 
the ecosystem. 



Assume that the equation system (10) has a solution: for i e {x, y, z}, the solution 
is unique because the profit functions are concave and it has the form h* = f(X). 
So the optimal harvests h* are: 

h* = fi(X) or h* = 0 



(13) 
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Conditions (11) imply that in a steady state, for all j e {x, y, z}. 



dQ.e(x,y,z) V—' ^Gi{x,y,z) 

pki = K, V- + E + V- 

dj . , , 97 

ie[x,y,z] 



Proposition 2 

7n a steady state, the interest on a species’ marginal value in the ecosystem equals 
the species’ marginal environmental benefit plus the species’ marginal benefit in 
maintaining its own and other species’ stock. 



Together, propositions 1 and 2 imply that environmental benefits and other species’ 
stocks must affect harvest size. Whether or not the harvest is higher or lower 
depends on the species’ effects on its own and other species’ growth rate, and on 
the environmental benefits. 

If J{{x, y, z, k, t) = max/, M{x, y, z, h, k, t), is a concave function of (x, y, z) 
for given k and t, then any policy is optimal that satisfies fhe conditions (9)- 
(12) and fhe fransversalify conditions (14). Arrow and Kurz (1970) provide fhese 
sufficiency conditions for infinife horizon problems. 

lim p“^'A,(t)>0, lim p“^'A,(t)f (t) = 0 (14) 

Lef Ar(x, y, z) be fhe principal minor of order r in fhe Hessian for J{(x, y, z, k, t). 
The maximized Hamiltonian M is concave on if and only if for all poinfs 
{x,y,z) and for all A^, (— 1)'' A^(x, y, z) > 0 for r = {1,2,3}. In case fhis 
condition does nol hold, fhe sufficiency conditions for optimum are nof safisfied, 
and fhere may be eifher several, one or no optimal solufions. If fhere is more fhan 
one solution candidafe, fhe comparison of benefils befween differenl solution pafhs 
may be necessary fo defermine, which one is optimal. 

When fhe foresl is opfimally exploifed, fwo cases can be disfinguished and musf 
be analyzed separafely. 

— Case 1: no species ever becomes exfincf. 

— Case 2: af leasf one of fhe species becomes exfincf af some poinf t\ in fime. 
This case is a bif complicafed because fhe analysis differs depending on which 
species disappears firsf. Crepin (2002, Ch. 2) analyzed fhis case when SYSl 
was optimized. 

If no species became exfincf, fhe exploifed sysfem would follow fhe equations 
of motion given by (15): 



X = GAx, y, z) - fixikx) 

y = Gy(x, y, z) - fiy(ky) 

z = Gj.(x,y,z) - fiziK) 

y _ dQe(x,y,z) 



- E 

i&[x,y,z] 



9G,(x, y, z) 



(15) 



9x 



9x 
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pXy — K, 

pAj- 



dQ.e{x, y, z) 



K 

dy IX 

ie[x,y,z] 

dQe(x,y,z) , dGi(x,y,z) 



dGi(x, y, z) 



dz 



- E 

ie[x,y,z] 



dz 



Brock and Malliaris (1989) showed methods to study such dynamic systems. 
Assume the initial species stocks were xq, yo, and zo at t = 0 and the initial 
shadow prices Xy^, and X^^. Then if the system (15)’s right hand side satisfied 
the Lipschitz condition (see Brock and Malliaris 1989), it had a unique solution 
defined for every species i e {x, y, z} by:® 



i*(t) = Ct>i(xo,yo,Zo,Kox^yoxKoG) 

X*(t) = ^x,{xo,yo,zo,X^g,Xyg,X^g,t) 

Assume there is an optimal solution and that the system (15) had at least one 
steady state. ^ Crepin (2002, Appendix) then showed that the eigenvalues of such 
steady state came in pairs a, p — a. So the saddle-point properties proven in Kurz 
(1968) remain, even when several steady states existed and the Hamiltonian was 
not concave. Proposition 3 follows directly: 



Proposition 3 

Suppose p > 0 then, in the neighborhood of a steady state; the system (15) is either 
totally unstable or has the instability characterized by the saddle-point property. 



If the system has several steady states, then for each of them, there is a limit value 
for p, say p, under which the steady state exhibits a local saddle-path property or 
has eigenvalues equal to zero and above which the steady state is locally unstable. 
This produces a series (^ of threshold values for p. Corollary 4 follows directly. 

Corollary 4 

Suppose the system (15) has several steady states. Let p = min(p^ and p = 
max(^. If p < p , there is a local saddle path that lead toward each steady 
state. If p > ~p, all of the steady states are locally unstable. If p > p > ~p, some 
steady states are locally unstable while others have a local saddle path. 



Note that often in such problems p = 0 and for 0 < p < p, there is an odd number 
of steady states, which come in consecutive pairs of saddle points and unstable 
states (Birkhoff 1927). 

The occurrence of several steady states implies that there is no obvious optimal 
trajectory from a given starting point. Candidate trajectories toward different equi- 
libria must be compared to determine, which one is optimal. Relation (16) follows 
from the classic Hamilton-Jacobi result and makes welfare comparisons possible 
when there is more than one candidate-optimal steady state. 
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/ B(h*,x,y,z)e ^‘dt = —Ji(xQ,yo,zo,K’^o) (16) 

Jo P 

This relation can also be used to localize Skiba points, which are initial states with 
more than one optimal path (Beyn et al. 2001). Note that as Deissenberg et al. 
(2001) reminded us, if strict concavity is not given, Skiba points generically do not 
coincide with the unstable steady states and the latter are not necessarily optimal. 
Wagener (2003) showed that for systems with one state and one control, if there 
was a cusp bifurcation (see Kuznetsov 1995 for a definition) when the discount rate 
was zero, then for small positive discount rates, the system had a Skiba point. This 
result has not yet been generalized to higher dimensions. 

Section 4 simulates a specific model of a boreal foresl wifh confinuous, 
mulliple-use harvesfing. 

4, Simulations with a Specific Model 

The simulations aim to answer the following questions: How many steady states 
exist and are they feasible? What is the optimal path in the exploited forest? What 
are the dynamics at each steady state? What do the dynamics of the system look 
like when they are not in a steady state? What is the basin of attraction of the 
steady states where no species become extinct? What happens in boundary points? 
Do bifurcations occur when parameters vary? 

Kuznetsov (1995), Judd (1998), and Beyn et al. (2001) showed several useful 
numerical methods. MATHCAD^ simulated steady states and the dynamics in their 
neighborhood.^ Steady state analysis is not enough to obtain a quality picture 
of the dynamics of the systems of differential equations that have been studied. 
Each system may have very complex dynamics outside of the steady states. Limit 
cycles^® or chaotic attractors'' may very well be present. Systems’ simulations 
using dynamics'^ (Nusse and Yorke 1997) help picture the systems’ dynamics 
outside steady states. In particular, this program can explore the system for limit 
cycles and chaos. MATLAB'^ solved two point boundary value problems and 
localized Skiba points in the optimized system using the method developed in Beyn 
et al. (2001). 

4.1. MULTIPLE-USE SPECIFIC MODEL 

Let qi and c, be positive constants that represent the unit price and some cost 
parameters, respectively, for species i e {x, y, z}. Then the profit from harvesting 
species i can be expressed as ^i(hi) = qihi — Cihi. Let a, s K[Ci and p, s 
then the weighted profit from harvesting species i is KiQi(hi) = aihi{pi — hi). 
This function has a convex cost part (o:,/i^) and is concave in harvest.'^ 

Let q, and 6 be constant weights associated with species x, y, and z respec- 
tively. q, 9 € (0, 1) and ^ + q + 9 < 1. Environmental and recreational benefits 
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from the forest are then f2g(x, y, z) = is inereasing and coneave in 

all species’s stocks and the environmental benefits from the forest are zero as soon 
as one species becomes extinct. This is a strong assumption but the functional 
form still accounts for important characteristics of environmental and recreational 
benefits: it increases with number of species and populations sizes. 

The multiple-use problem from section 3 is now transformed into: 

'Y^aihiipi - hi) + KeX^y’^z'^'^ e~^‘dt 

s.t. X = X — x^ -h UxyXy -h QxzXZ — 
y = ryy -y^ - Uy^xy - Uy^z - hy 

z = VzZ^ -z^ - UzxXZ - Uzyy - hz 

The Lagrange function is still given by (8), and the Hamiltonian (7) is rewritten 
with appropriate functional forms: 

M{x,y,z,h,X,t) = '^aihiipi — hi) + KgX^’ y'^z^ 

i 

+Xx(x — x^ + OxyXy + GxzXZ — hx) 

+Xy{ry,y - y'^ - UyxXy - UyzZ - hy) 

+KifzZ^ -z^ - Ozxxz - Uzyy - hz) 

The concavity conditions (17) are complicated. They are computed in appendix 5 
for Kg = 0: 



max 

h 



I 



kx > 0 (17) 

4kxky ^ (^kxClxy ^y^yx) 

kz(Sz ^P^) {kxClxy ^y^yx) ) — ^^y(.^x^xz ^Z^Zx) 

They imply that the marginal value of having moose and birch in the forest, respec- 
tively, must be positive. The marginal value of having pine in the forest must be 
positive if z* is relatively large {z* > j), and negative if z* is relatively small 
(z* < j). In each case, z* must be large enough or small enough to satisfy the 
third sufficiency condition. When the pine population has reached one-third of 
its carrying capacity. For smaller pine populations, the pines’ growth rate is convex. 
If the pine population is larger, it is concave (recall Figure 1). 

For Kg > 0, that is when recreational and environmental benefits enter the 
social welfare function, concavity conditions are very tedious to compute. Simi- 
larly to when Kg = 0, concavity conditions are not always satisfied, which implies 
that welfare comparisons between trajectories are necessary and the maximizing 
problem may have one, several or no optimal solutions, depending on the initial 
state. As pointed out in section 3, comparing the value of the Hamiltonian for 
different initial states can help sort out different optimal trajectory candidates. 
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Table III. Parameter values in the main simulation 



parameter 


G 


rz 




^xz 


a-yx 


Uyz 


^zx 


Uzy 


p a 


value 


1.6 


2 


1 


0.5 


0.1 


0.2 


0.05 


0.7 


0.02 1 


parameter 


P 


y 


Ke 


Hx 


qy 


qz 




7 


0 


value 


1 


1 


0.1 


1 


2 


2 


1/3 


1/3 


1/3 



The optimal harvest is written, for all i : 



h* = 



1 

2 




+ Ili 



01 : 



or /i* = 0 



When no species are depleted by (12), the Lagrange multipliers must equal zero at 
every point in time (/x, = 0). If shadow prices are large enough compared to costs 
(Vi, Pi > ^), all harvests are positive. Assuming that this is the case, an optimal 
trajectory (x*, y* , z*, A.*) must solve SYS2. 



X 

y 

z 

Xx 

Xy 

K 



X - X + UxyXy + QxzXZ - -Px + 7 

2 2ay 



(SYS2) 



ryj -y - ayxXy - Qy^Z 



TiPy + 4 — 
2 lay 



2 3 

r^Z -Z^ 



Qzxxz - Uzyy - -Pz + — 
2 2a, 



-Ke^x‘’~^y'^z'^ - Xx{\ - 2x + Uxyy + UxzZ - p) + XyUyxy + XzUzxZ 
—Kgijx^ y’’~^ z^ — XxUxyX — Xy(ry — 2y — ayxX — p) + XzUzy 
—Ke9x^y^z^~^ — XxQxzX + XyUyz — Xz(2rzZ — — a„x — p) 



4.2. SIMULATIONS 

Giving numerical values to parameters helps simulate this differential equation 
system. Table 111 shows the values that were assessed to the parameters in the main 
simulation. 

Note that there are some reasons to treat cases = 0 and K^, > 0 separately. 
First the system’s dynamics are much easier to simulate when Ke = 0. Second 
Kg = 0 can also be interpreted as the case of a private owner who only maximizes 
profits from forestry. Then a, = c,- for all i e {x, y, z}: it is reasonable to believe 
that private owners do not care about where profits come from, because they have 
no redistribution goals. 

Even when parameters are replaced with numeric values, SYS2’s steady states 
cannot be analytically computed. They must be evaluated numerically, which 
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Table IV. Steady states 



Steady state x 


V 


Z 


Ax 


Ay 




S3* 


1.767 


0.503 


1.138 


-0.078 


1.53 


0.672 


53* 


0.979 


0.367 


0.971 


-0.71 


1.555 


0.668 


S3* 


0.234 


0.624 


1.207 


0.067 


1.294 


0.591 




0.537 


0.231 


0.036 


0.235 


1.407 


2.32 


52* 


2.14 


2.837 


0 


-6.263 


10.233 


2 


52* 


2.263 


1.525 


0 


-0.186 


2.46 


2 



implies that the number of steady states could be underestimated. Usually, numeri- 
cally evaluated steady states depend on quality initial guesses; different initial 
guesses can lead to different steady states. For each set of tested parameter values, 
the steady states were simulated with MATHCAD using a thousand different 
randomly chosen initial values between zero and ten for each variable. 

For the benchmark’s parameter values, with Kg = 0, SYS2 has four feasible 
interior steady states (53*, 53^, 53*, 53^) and one steady state with a negative pine 
biomass, which is not feasible. Three of them (53*, 53* and the infeasible state) 
are saddle points and the other two are unstable with only two eigenvalues with 
negative real part. This is not surprising given the result stated in proposition 3 and 
its corollary 4. The existence of a saddle steady state, with negative pine biomass, 
indicates that for some initial states, depletion of all pine may be optimal. This 
possibility was studied in Crepin (2002, Ch. 2).'^ The SYS2 system was simulated 
to check what happened when pine disappeared. This produced two steady states 
with no pine. Table IV shows the variables’ values in all steady states. The steady 
states in bold are saddle points. 

So for the benchmark’s parameter values with K^, = 0, the optimized forest 
characterized by SYS2 had at least six feasible steady states, of which at least 
two lead to pine depletion. It is easy to verify that steady state 53* is the only 
equilibrium that satisfies the sufficiency conditions (17). 

Simulations^* using the method developed in Beyn et al. (2001) showed 
the existence of Skiba points. For example such points were (4.98,0.5,1.1), 
(2, 1.33, 1.1), and (2,0.5, 1.86). In those points, the welfare obtained by going 
to steady state 53* was about the same as the welfare obtained when going to 
53*. This shows that 53* could sometimes be optimal even though the sufficiency 
conditions do not hold. Whether or not this is true depends on initial conditions. 

The sets of all Skiba points form the Skiba manifolds of this system. Locating 
all Skiba points would be very tedious. According to Beyn et ah, one could use the 
initial state variables as continuation parameters to approximate the Skiba mani- 
fold. Depending on whether or not pine extinction can ever be optimal there might 
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be Skiba manifolds that separate the regions with optimal interior states from the 
regions where a steady state with no pine is optimal. 

The effects of variations in the birch growth rate on respective species’ 
biomasses in steady states were simulated in the case when environmental benefits 
were not accounted for {K^ = 0). These simulations also showed bifurcations for 
low birch growth rates. For below 0.45, there was only one feasible steady state; 
for Ty above 0.65, there were four and in between, there were three steady states. 
Variations in other parameters’ value also lead to bifurcations. 

When a social planner accounted for environmental benefits, the results were 
modified. Some simulations showed that the number of steady states was generally 
higher for relatively low birch growth rates and smaller for higher birch growth 
rates. Comparing the steady states in both management regimes shows that the 
saddle points have higher levels of each population when environmental benefits 
are accounted for. This is not true for the unstable equilibria, which are probably 
not optimal but this is not proved yet. For the benchmark’s parameter values, there 
are only three steady states and no infeasible saddle point. This hints that the risk of 
pine becoming extinct is much lower when environmental benefits are accounted 
for, which is not surprising. 



5, Concluding Remarks 

This paper shows that a boreal forest, managed to maximize values derived from 
it, may have several optimal interior steady states. Each optimal state is reachable 
by following an optimal saddle path. On this path, the marginal value derived from 
harvesting each species equals the marginal value of retaining the species in the 
ecosystem at each point in time. The path that is optimal to follow and thus the 
optimal steady state depends on the initial state of the system. The paper shows 
also that for some initial states - Skiba points - there are more than one optimal 
trajectory, which lead to different steady states. 

The possibility of multiple steady states was already pointed out in Clark (1990) 
but this was assumed to occur only in specific problems of resource economics. 
This paper reinforces results from earlier literature on shallow lakes and coral reefs 
and shows thereby that this kind of phenomena is probably much more common 
than previously thought. The non-concave pine growth is an essential assumption 
that determines the results. This paper shows that: 

— The results obtained in the shallow lake and coral reef literature cited in 
introduction are not due to specific properties of the sigmoid function used in 
these articles. Rather, they depend on the non-linearity present in the equation 
system. It does not seem to matter how this non-linearity is modeled. 

— Non-linearity in pine growth affects growth and populations of moose and 
spruce and their harvest. So the results may be obtained even if there was 
non-linearity in the dynamics of a species that is not even harvested, as long 
as the species affects the harvested species. This is a crucial point because 




144 



ANNE-SOPHIE CREPIN 



if this is a general finding and if many ecosystems host species with non- 
linear dynamics that affect harvested species’ dynamics, this result could be 
generalized to most natural resources produced within ecosystems. In that 
case, traditional models of natural resource management derived from Gordon 
(1954), Scott (1955), Clark (1973) and Clark (1976) would almost always fail 
in the long-run. 

The existence of Skiba points means that at those points there are multiple 
accounting prices for the ecosystem resources. This may have important impli- 
cations for valuation studies: which price should be chosen as accounting price in 
a Skiba point? 

What do these results imply for a manager? The existence of multiple steady 
states reveals the exploited ecosystem’s dependency on history. What is optimal for 
one state of the world is not necessarily optimal for another. This has implications 
for the management of such systems. If the system had had a unique interior steady 
state, the forest owner could have reached the optimal saddle path by harvesting 
just enough for marginal costs to equal marginal benefits from harvest - including 
costs and benefits due to environmental changes and effects on other species. 
Here, margin analysis is usually not enough to determine the optimal trajectory 
at given initial points. One needs more information. Some of the steady states are 
minimizing rather than maximizing solutions and can easily be ruled out. For the 
remaining maximizing steady state solutions, the optimal solution depends on the 
initial state. 

Furthermore if an optimal trajectory passes close to a Skiba manifold, small 
management mistakes could lead to the manifold’s crossing. If this happens another 
trajectory becomes optimal and the future harvest opportunities can be completely 
modified. Exogenous changes in the system can also lead to a Skiba manifold 
crossing. Such exogenous changes include any changes that affect the variables, 
such as diseases, storms, and exogenous market shocks. 

A forest owner searching to maximize the values obtained from the forest would 
need to know for sure all future harvesting benefits and costs to find out which 
trajectory is optimal. Forest owners are usually not so knowledgeable. In fact costs 
and benefits become more and more uncertain the further the time horizon is. So 
one cannot once and for all determine the optimal management tactics. 

What should forest owners do? For example, they may want to revise their 
tactics at more or less regular intervals to see whether they need to adapt to changes 
or not. This is difficult because at each revision time they need to gather new 
information about all future costs and benefits, which is costly. This cost and the 
potential benefits obtained from adapting to changes give forest owners indications 
about how often they should recalculate the management tactics. The managers 
could also try to detect the thresholds between the basins of attraction of the 
different steady states. This means that they need to identify the Skiba manifolds 
or at least find a way to approximate their location. 
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Notes 

1. See Kuznetsov (1995) for a proof. 

2. Carrying capacity refers to the population size that can be supported in the area studied. It is the 
population size for which natural growth is equal to zero when there are no interactions from 
other species. 

3. In Table I, k solves Oy^a^y — Vyr^a^yZ + VyO^yZ^ + Z^r^ — 2r^Z^ + Z^, which has 
at most four real roots, (p — j(‘'z ~ ‘^zx‘^xz ^ x / ((U — cizx<^xz)^ ~ ‘^‘^zx))- X(k) — 

azy+axz^zyX+fXxyXzX^~axyX^ Vtfr-I .. ~^zx-^(Xz~^zx^xz)x—fc^ 

^zxZtxyX~\~(izy ' ^zy~^^xz^zxX 

4. A fold bifurcation is a bifurcation that corresponds to the presence of an eigenvalue equal to 
zero. When this happens, two equilibria collide and disappear (Kuznetsov 1995). 

5. A Hopf bifurcation corresponds to the presence of complex conjugate eigenvalues with zero real 
parts. 

6. This is the case when Vi e [x, y, z}, tfr; (A,, ) and its derivatives with regard to kj are continuous. 

7. Unfortunately, the usual existence theorems cannot be applied to guarantee the existence of an 
optimal solution because the concavity conditions have not necessarily been met. The existence 
of a steady state of the system (15) is also not guaranteed. 

8. Mathcad is a software used to solve math problem. Both numerical and analytical methods can 
be used. 

9. A copy of this program code can be requested from the author. 

10. A limit cycle is an isolated cycle of a continuous time dynamic system. A cycle is a periodic orbit, 
that is a non-quilibrium orbit such that a trajectory starting at a point will return to the same point 
after a time period called period. See Kuznetsov (1995) for a more detailed definition. 

11. An attractor is roughly a subset of the phase space toward which the initial conditions may 
be attracted. An attractor is said to be chaotic when and if we take two typical points on the 
attractor that are separated from each other by a small distance; then, for increasing time, these 
points move apart exponentially fast. Thus a small uncertainty in the initial state of the system 
rapidly leads to the inability to forecast its future. See Grebogi et al. (1987) for further reading 
and references on the topic. 

12. DYNAMICS is a program that explores the dynamics of differential and difference equation 
systems. 

13. MATLAB is a matrix based interactive program doing numeric computation and data visualiza- 
tion. 
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14. This relies on these assumptions: recreational henefits from moose hunting are neglected and 
profits are assumed to be independent from moose density; timber harvesting has no effect on 
the timber’s market price; and there are no returns to scale. 

15. To measure diversity, alternatives to the Cobb Douglas function can be found in Stirling and 
Wilsey (2001) or Norberg et al. (2001). 

16. Different ranges and different numbers of initial guesses were also tested for some parameter 
values and did not produce additional steady states. 

17. Note that the general saddle point property stated in proposition 3 does not remain when some 
species are extinct. 

18. The program codes can be requested from the author. 
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Appendix 



Let Ar(x, y, z) be the principal minor of order r in the Hessian for M{x,y,z,k,t). The 
maximized Hamiltonian M is concave on if and only if for all points {x,y,z) and for 

all Ar, (— 1)'' Ar(x, y, z) > 0 for r = 1, 2, 3. The Hessian for M{x, y, z, t) is given by: 

'2.x X XxClxy ^yt^yx ^x^xz ^z^zx 

XxClxy XyClyx 2Xy 0 

^xtixz ^z^zx 0 X^ilr^ 6z) 

From this, it follows that: 

i-iyAi(x,y,z) = 2Xx 

( 1 ) y ■> z) — 4-XxXy iXxClxy 

( 1) ■> y ■> — AXxXyX'^i^V'^ 6z) {XxClxy ^yClyx^ ^z{'^^Z 

~^2Xy(XxCtxz ^z^zx) 

This yields the concavity conditions: 

> 0 

4XxXy > {XxUxy XyClyx) 

^z{^Z 2r^){4-XxXy {Xxl^xy ^yClyx^ ) — 2Xy{XxClxz ^z^zx^ 
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do not presume that the economy is convex, nor do we assume that the government optimizes on 
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evaluation and for assessing whether or not intergenerational welfare along a given economic path 
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1. Introduction 

In several recent publications, it has been shown that there is a wealth like measure 
that can serve as an index of intergenerational welfare. The index enables one (a) to 
check whether welfare will be sustained along an economic forecast, and (b) to 
conduct social cost-benefit analysis of policy reforms (e.g., investment projects). 
Excepting under special circumstances, however, the index in question is not 
wealth itself, but an adaptation of wealth. Interestingly, the results do not require 
the economy to be convex, nor do they require the assumption that the government 
optimizes on behalf of its citizens. ' 

An economy’s wealth is the worth of its capital assets. As is widely recognised 
today, the list of assets should include not only manufactured capital, but also 
human capital (health, knowledge, and skills), and natural capital. Eormally, an 
economy’s wealth is a linear combination of its capital stocks, the weights awarded 
to the stocks being the latter’s accounting prices. 
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The term aeeounting priees was used originally in the literature on eeonomie 
planning (Tinbergen 1954). The underlying presumption there was that govern- 
ments are intent on maximizing social welfare. Public investment criteria were 
subsequently developed for economies enjoying good governance (Little and 
Mirrlees 1968, 1974; Arrow and Kurz 1970). In its turn the now-extensive literature 
exploring various concepts of sustainable development has also been directed at 
societies where governments choose policies so as to maximize intergenerational 
welfare.^ 

Sustainability is different from optimality. To ask whether collective well-being 
is sustained along an economic forecast is to ask, roughly speaking, whether the 
economy’s production possibility set is growing. The concept of sustainability is 
useful for judging the performance of economies where the government, whether 
by design or incompetence, does not choose policies that maximise intergenera- 
tional welfare. One can argue, therefore, that the term “sustainable development” 
acquires particular bite when it is put to work in imperfect economies, that is, 
economies suffering from weak, or even bad, governance. Recently the theory of 
intertemporal welfare indices has been extended to such economies.^ The theory’s 
reach therefore now extends to actual economies. The theory has also been put to 
use in a valuable paper by Hamilton and Clemens (1999) forjudging whether in the 
recent past countries have invested sufficiently to expand their productive bases."* 
Among the resources making up natural capital, only commercial forests, oil and 
minerals, and the atmosphere as a sink for carbon dioxide were included in the 
Hamilton-Clemens work. Not included were water resources, forests as agents of 
carbon sequestration, fisheries, air and water pollutants, soil, and biodiversity. Nor 
were discoveries of oil and mineral reserves taken into account. Moreover, there 
is a certain awkwardness in several of the steps Hamilton and Clemens took when 
estimating changes in the worth of an economy’s capital assets. Our aim in this 
paper is to clarify a number of issues that arise in putting the theory of welfare 
indices to practical use. It is our hope that the findings documented here will prove 
useful in future empirical work. 

We are interested in three related questions: (1) How should accounting prices 
be estimated? (2) How should we evaluate policy change in an imperfect economy? 
(3) How can we check whether intergenerational well-being will be sustained along 
a projected economic programme? 

For simplicity, we confine our analysis until Section 14 to a deterministic world. 
In Section 2 we rehearse the basic theory.^ We prove that the same set of accounting 
prices should be used both for policy evaluation and for assessing whether or not 
intergenerational welfare along a given economic path will be sustained. We also 
show that a comprehensive measure of wealth, computed in terms of the accounting 
prices, can be used as an index for problems (2) and (3) above. These results do not 
require that the economy be convex, nor do they depend on the assumption that the 
government optimizes on behalf of its citizens subject to constraints. 
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In Section 3 we use the Ramsey-Solow model of national saving in a convex 
economy to illustrate the theory. In Section 4 we show that the theory can be put to 
use in non-convex economies by studying a particular class of ecosystems, namely, 
shallow lakes. The remainder of the paper is concerned with rules for estimating 
the accounting prices of specific environmental natural resources, transacted in a 
few well known economic institutions. 

In order to make our findings easily accessible for empirical work, we report 
our findings as a cafalogue of resulfs. Rules for esfimafing accounting prices of 
exhaustible nafural resources under bofh free and resfricfed enfry are derived in 
Section 5. In Secfion 6 we show how expendifure toward fhe discovery of new 
deposifs oughf to be incorporafed in nafional accounfs. Secfion 7 develops mefhods 
for including foresl depletion; and in Section 8 we show how fhe producfion of 
human capifal could be faken into accounf. In Section 9 we sfudy fhe valuafion of 
global public goods. 

If an economy were fo face exogenous movemenfs in cerfain variables, ifs 
dynamics would nol be aufonomous in lime. Non-autonomy in time infroduces 
additional problems for fhe conslruclion of fhe required welfare index, in lhal 
fhe weallh measure requires lo be augmenfed. Exogenous growlh in faclor 
produclivilies, for example, is a polenlial reason for non-aufonomous dynamics. 
In Secfion 10 we show lhal by suifably redefining variables, if is oflen possible 
to Iransform a non-aufonomous economic system into one lhal is aufonomous. Bui 
such helpful Iransformalions are nol available in many olher cases. In Secfion 1 1 we 
show lhal Ihe required welfare index can neverlheless be conslrucled, by sludying 
a small counlry exporting an exhaustible nalural resource al a price lhal is lime- 
dependenl. The way defensive expendifure againsl pollution oughf to be included 
in national accounls is discussed in Section 12. 

The Iheory developed uplo and including Section 12 assumes lhal population 
remains conslanl. In Section 13 we extend Ihe Iheory to cover population change.^ 
In Section 14 we show how fulure uncerlainly in commodify Iransformalion 
possibilities can be incorporated. Section 15 conlains concluding remarks. 

2, The Basic Model 

2.1. PRELIMINARIES 

We assume lhal Ihe economy is closed. Time is continuous and is denoted variously 
by r and 1 (t, 1 > 0). The horizon is laken to be infinite. For simplicily of exposition, 
we aggregate consumption into a single consumption good, C, and lei R denote a 
vector of resource flows (e.g., rates of exlraclion of nalural resources, expendifure 
on education and heallh). Labour is supplied inelaslically and is normalised to be 
unify. Inlergeneralional welfare (henceforlh, “social welfare”) al 1 (> 0) is laken to 
be of Ihe Ramsey-Koopmans form. 
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( 1 ) 



where the utility funetion, U(C), is strietly concave and monotonically increasing. 

The state of the economy is represented by the vector K, where K is a compre- 
hensive list of capital assets. The economy under study faces not only technological 
and ecological constraints, but also a wide variety of institutional constraints. By 
the economy’s “institutions” we mean market structures, property rights, tax rates, 
non-market arrangements for credit, insurance, and common property resources, 
the character of various levels of government, and so forth. We do not assume 
that the government is necessarily bent on maximizing social welfare subject to 
constraints. It could be that the government is predatory, or is at best neglectful, 
and has objectives of its own that are not congruent with social welfare. Nor do we 
imagine institutions to be unchanging over time. What we do assume is that institu- 
tions coevolve with the state of the economy (K) in ways that are understood. It is 
no doubt a truism that social and political institutions influence the evolution of the 
state of an economy, but it has also been argued by political scientists (Lipset 1959) 
that the state of an economy (K) influences the evolution of social and political 
institutions. The theory we develop below accommodates this mutual influence. 

Let {Cr, Rt, be an economic programme from t to oo. Given techno- 

logical possibilities, resource availabilities, and the dynamics of the ecological- 
economic system, the decisions made by individual agents and consecutive govern- 
ments from t onwards will determine C^, Rt, and - for r > t - as functions 
of Kt, r, and t. Thus let f(Kt, t, t), g(Kt, r, t), and h(Kt, r, t), respectively, be 
consumption, the vector of resource flows, and the vector of capital assets at date 
T (> t) if Kt is the vector of capital assets at t. Now write 

(|,)~s{C„R„K,}t~, fort>0. (2) 

Let {t, Kt} denote the set of possible t and Kt pairs, and {(?T)t“} the set of 
economic programmes from t to infinity. 

Definition 1 

A resource allocation mechanism, a, is a (many-one) mapping 

a : {t, Ktl ^ {(|.)t“}. (3) 

It bears emphasis that we do not assume that a maps {t, Kt) into to optimum 
economic programmes (starting at t), nor even that it maps {t, Kt} into efficient 
programmes (starting at t). The following analysis is valid even if a is riddled 
with economic distortions and inequities. Nor do we assume, in defining a, that 
the economy’s institutions are fixed. If institutions and the state of the economy 
were known to coevolve, that coevolution would be reflected in a. Note too that 
we do not assume commodity transformation possibility sets to be convex. This 
is significant, because ecological processes involve transformation possibility sets 
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that are frequently non-convex; displaying, for example, threshold effects. The 
reason we are able to accommodate non-convex production structures is that we 
are developing welfare economics in imperfect economies: we assume that the 
government (rather, some honest agency in government) seeks only to institute 
policy reform. For an optimizing government the matter would be different. As 
the Second Fundamental Theorem of Welfare Economics makes clear, production 
structures need to be convex if the optimum allocation is to be decentralized. 

Definition 2 

a is time-autonomous (henceforth autonomous) if for all r > t, fr is a function 
solely of Kt and (r - 1). 

Notice that if a is autonomous, economic variables at date r (> t) are func- 
tions of Kt and (r - t) only, a would be non-autonomous if, for example, 
knowledge or the terms of trade (for a trading economy) were to change exogen- 
ously over time. In certain cases exogenous changes in population size would 
mean that a is not autonomous. However, by suitably redefining state variables, 
non-autonomous resource allocation mechanisms can sometimes be mapped into 
autonomous mechanisms (Sections 10 and 13). 

Definition 3 

a is time-consistent if 



h(K,s r", x') = h(Kt, r", t), for all r", x' , and t. (4) 

Time-consistency implies a weak form of rationality. An autonomous resource 
allocation mechanism, however, has little to do with rationality; it has to do 
with the influence of external factors (e.g., whether trade prices are changing 
autonomously). In what follows, we assume that a is time-consistent. 

Definition 4 

The value function reflects social welfare (equation (1)) as a function of initial 
capital stocks and the resource allocation mechanism. We write this as 



Wt = V(Kt,a,t). 



( 5 ) 



In what follows, we will often write V(Kt, a, t) = Vt. 

Let Ki be the ith capital stock. We assume that V is differentiable in K.^ 

Definition 5 

The accounting price, pit, of the ith capital stock is defined as 



Pit = 9V(Kt, a, t)/9Kit = 9Vt/9K;t. 



( 6 ) 



Note that accounting prices are defined in terms of hypothetical perturbations to 
an economic forecast. Specifically, the accounting price of a capital asset is the 
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present discounted value of the perturbations to U that would arise from a marginal 
increase in the quantity of the asset. Given the resource allocation mechanism, 
accounting prices at t are functions of Kt, and possibly of t as well (i.e., pit = 
Pi(Kt, t)). The prices depend also on the extent to which various capital assets are 
substitutable for one another. It should be noted that accounting prices of private 
“goods” can be negative if property rights are dysfunctional, such as those that lead 
to the tragedy of the commons. Note too that if ol is autonomous, accounting prices 
are not explicit functions of time, and so, pit = Pi(Kt). 

2.2. MARGINAL RATES OF SUBSTITUTION VS MARKET OBSERVABLES 

Using (1) and (6), it can be shown that, if a is autonomous, pit satisfies the 
dynamical equation. 



dpit/dt = 5pit - U'(C,)9Q/9Kit - ^pjt9(dKjt/dt)/9Kit. (7) 

j 

(7) reduces to Pontryagin equations for co-state variables in the case where a is 
an optimum resource allocation mechanism. In any event, we show below that, in 
order to study the evolution of accounting prices under simple resource allocation 
mechanisms, it is often easier to work directly with (6). 

From (6) it also follows that accounting price ratios (pit/pjt, Pir/Pit> and consump- 
tion discount rates (see below)) are defined as marginal social rates of substitution 
between goods. In an economy where the government maximizes social welfare, 
marginal rates of substitution among goods and services equal their corresponding 
marginal rates of transformation. As the latter are observable in market economies 
(e.g., border prices for traded goods in an open economy), accounting prices are 
frequently defined in terms of marginal rates of transformation among goods and 
services. However, marginal rates of substitution in imperfect economies do not 
necessarily equal the corresponding marginal rates of transformation. A distinction 
therefore needs to be made between the ingredients of social welfare and “market 
observables”. Using market observables to infer social welfare can be misleading 
in imperfect economies. That we may have to be explicit about welfare parameters 
(e.g., 8 and the elasticity of U'(C)) in order to estimate marginal rates of substitution 
in imperfect economies is not an argument for pretending that the economies in 
question are not imperfect after all. In principle it could be hugely misleading to use 
the theory of optimum control to justify an exclusive interest in market observables. 

2.3. GENUINE INVESTMENT AS A MEASURE OF SUSTAINABLE DEVELOPMENT 

lUCN (1980) and World Commission (1987) introduced the concept of sustainable 
development. The latter publication defined susfainable development to be “. . . 
development that meets the needs of the present without compromising the ability 
of future generations to meet their own needs” (World Commission 1987, p. 43). 
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Several formulations are consistent with this phrase. But the underlying idea is 
straightforward enough: we seek a measure that would enable us to judge whether 
an economy’s production possibility set is, in a loose sense, growing. Our analysis 
is based on an interpretation of sustainability that is based on the maintainence 
of social welfare, rather than on the maintainenance of the economy’s productive 
base. We then show that the requirement that economic development be sustain- 
able implies, and is implied by, the requirement that the economy’s productive 
base be maintained (Theorems 1-3). These results give intellectual support for the 
definition of sustainability we adopt here.^ 

Definition 6 

The economic programme {Ct, Rt, Kt)o“ corresponds to a sustainable develop- 
ment path at t if dVt/dt > 0.^ 

Notice that the above criterion does not attempt to identify a unique economic 
programme. In principle any number of technologically and ecologically feasible 
economic programmes could satisfy the criterion. On the other hand, if substitution 
possibilities among capital assets are severely limited and technological advances 
are unlikely to occur, it could be that there is no sustainable economic programme 
open to an economy. Furthermore, even if the government were bent on optimising 
social welfare, the chosen programme would not correspond to a sustainable path 
if the utility discount rate, h, were too high. It could also be that along an optimum 
path social welfare declines for a period and then increases thereafter, in which 
case the optimum programme does not correspond to a sustainable path locally, 
but does so in the long run.'° 

Optimality and sustainability are thus different notions. The concept of sustain- 
ability helps us to better understand the character of economic programmes, and is 
particularly useful forjudging the performance of imperfect economies. 

We may now state 

Theorem 1 

dVt/dt = PitdKit/dt + 9Vt/9t. (8) 

i 

The proof follows directly from equations (5) and (6). 

Definition 6 

The accounting value of the rate of change in the stocks of capital assets is called 
genuine investment. 

If a is autonomous, then 9Vt/9t = 0, and so, from equation (8) we have. 
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Theorem 2 

If a is autonomous, then 

dVt/dt = ^PitdKit/dt." (9) 



Equation (9) states that at each date the rate of change in social welfare equals 
genuine investment. Theorem 2 gives a local measure of sustainability. Integrating 
(9) yields a non-local measure: 



Theorem 3 

If a is autonomous, for all T > 0, 



Vt - Vo = ^ [pitKit - PioKio] 



/’ 



^(dpiJdr)Ki, 



dr. 



( 10 ) 



Equation (10) shows that in assessing whether or not social welfare has increased 
between two dates, the “capital gains” on the assets that have accrued over the 
interval should be deducted from the difference in wealth between the dates. 

Each of Theorems 1, 2 and 3 is an equivalence result. None says whether a 
gives rise to an economic programme along which social welfare is sustained. Eor 
example, it can be that an economy is incapable of achieving a sustainable develop- 
ment path, owing to scarcity of resources, limited substitution possibilities among 
capital assets, or whatever. Or it can be that although the economy is in principle 
capable of achieving a sustainable development path, social welfare is unsustain- 
able along the path that has been forecast because of bad government policies. 
Or it can be that a is optimal, but that because the chosen utility discount rate is 
large, social welfare is not sustained along the optimum economic programme. Or 
it can be that along an optimum path social welfare declines for a period and then 
increases thereafter. 



2.4. WHAT ELSE DOES GENUINE INVESTMENT MEASURE? 

Genuine investment is related to changes in future consumption brought about by 
it. Imagine that the capital base at t is not Kt but Kt -i- AKt, where as before, A is 
an operator signifying a small difference. In the obvious notation, 

/ CXD 

U'(C,)A(C,)e“^^"“‘Mr. (11) 

Now suppose that at t there is a small change in a, but only for a brief moment. 
At, after which the resource allocation mechanism reverts back to ol. We write 
the increment in the capital base at t -i- At consequent upon the brief increase in 
genuine investment as AKt. So AKt is the consequence of an increase in genuine 
investment at t and (Kt+At + AKt) is the resulting capital base at t -i- At. Let At 
tend to zero. Erom equation (11) we obtain 
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Theorem 4 

Genuine investment measures the present discounted value of the changes to 
consumption services brought about by it}^ 

2.5. PROJECT EVALUATION CRITERIA 

Theorem 4 provides a criterion for social cost-benefit analysis of policy reforms. 
Imagine that even though the government does not optimize, it can bring about 
small changes to the economy by altering the existing resource allocation mecha- 
nism in minor ways. The perturbation in question could be small adjustments to 
the prevailing structure of taxes for a short while, or it could be minor alterations 
to the existing set of property rights for a brief period, or it could be a small public 
investment project. Call any such perturbation a “policy reform”. 

Consider as an example an investment project. It can be viewed as a perturba- 
tion to the resource allocation mechanism a for a brief period (the lifetime of the 
project), after which the mechanism reverts back to its earlier form. We consider 
projects that are small relative to the size of the economy. How should they be 
evaluated? 

For simplicity of exposition, we suppose there is a single manufactured capital 
good (K) and a single extractive natural resource (S). The rate of extraction is 
denoted by R. Let the project’s lifetime be the period [0, T]. Denote the project’s 
output and inputs at t by the vector (AYt, ALt, AKt, ARt). We imagine that if the 
project is accepted, the project manager would rent AKt at t for the period t to t -i- 

At.'3 

The project’s acceptance would perturb consumption under a. Let the pertur- 
bation at t (> 0) be ACt. It would affect Ut by the amount U'(Ct)ACt. However, 
because the perturbation includes all “general equilibrium effects”, it would be 
tiresome if the project evaluator were required to estimate ACt for every project 
that came up for consideration. Accounting prices are useful because they enable 
project evaluators to estimate ACt indirectly, which means that they do not 
have to go beyond project data in order to evaluate projects. Now, it is most 
unlikely that consumption and investment have the same accounting price in an 
imperfect economy. So we divide AYt into two parts: changes in consumption 
and in investment in manufactured capital. Denote them as ACt and A(dK/dt), 
respectively. 

U is the unit of account. Let Wt denote the accounting wage rate. Next, let qt 
be the accounting price of the extractive resource input of the project and At the 
social cost of borrowing capital (i.e.. At = 5 - [dpt/dt]/pt).^^ 

From the definition of accounting prices, it follows that: 

POO 

/ U'(C,)AC,e“^"dT = (12) 

Jo 

I (U'(Cr)AC, -hprA(dKJdr) - w^AL^ - A^p^AK^ - q, AR,)e“^Mr. 
Jo 




158 



KENNETH J. ARROW ET AL. 



But the RHS of (12) is the present discounted value of social profits from the project 
(in utility numeraire). Moreover, U'(Ct) AC^e^^^dr = AVq, the latter being 
the change in social welfare if the project were accepted. We may therefore write 
(12) as, 



AYo= (U'(Ct)ACt +PTA( dKr/dT) - WtAU -ArPrAKr -qTART)e“^’'dT. (13) 
Jo 

Equation (13) leads to the well-known criterion for project evaluation: 

Theorem 5 

A project should be accepted if and only if the present discounted value of its social 
profits is positive. 

2.6. NUMERAIRE 

So far we have taken utility to be the unit of account. In applied welfare economics, 
however, it has been found useful to express benefits and costs in terms of current 
consumption. It will pay to review the way the theory being developed here can be 
recast in consumption numeraire. For simplicity of exposition, assume that there 
is a single commodity, that is, an all-purpose durable good that can be consumed 
or reinvested for its own accumulation. Assume too that the elasticity of marginal 
utility is a constant, 17. Define pt fo be fhe accounting price of fhe assef af f in ferms 
of consumpfion af f; fhaf is, 

Pt = p,/U'(Ct). (14) 

If follows from (14) fhaf, 

(dpt/df)/pt = (dpt/df)/pt -h ?7(dCt/dl)/Ct. (15) 

Lef Pt be fhe social rale of discounf in consumption numeraire, pt is sometimes 
referred fo as fhe consumption rale of inleresl (Lillie and Mirrlees 1974). From (1), 

Pt = 5 + p(dCt/dl)/Ct.*^ (16) 

Using (16) in (15) we obfain fhe relationship belween fhe assel’s prices in fhe Iwo 
unils of accounl: 

(dpt/dl)/pt = (dpt/dl)/pt -I- Pt - 3.^’ (17) 



2.7. INTRAGENERATIONAL DISTRIBUTION 

The dislribulion of well-being wilhin a generation has been ignored so far. Theore- 
lically if is nol difficull fo include Ibis. If Ihere are N people in each generalion 
and person) consumes Cj, her welfare would be U(Cj).^* A simple way fo express 
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/iifragenerational welfare would be to “concavify” U. Let G be a strictly concave, 
increasing function of real numbers. We may then express intragenerational welfare 
as ^j(G(U(Cj))). Some people would be well-off, others badly-off. The formula- 
tion ensures that at the margin, the well-being of someone who is badly off is 
awarded greater weight than that of someone well-off. 

The social worth of consumption services (C) depends on who gets what. To 
accommodate this idea, we have to enlarge the set of commodities so as to distin- 
guish, at the margin, a good consumed or supplied by one person from that same 
good consumed or supplied by another. Thus, a piece of clothing worn by a poor 
person should be regarded as a different commodity from that same type of clothing 
worn by someone who is rich. With this re-interpretation of goods and services, the 
results we have obtained continue to hold. 

Relatedly, we should note that the connection between rural poverty in the 
world’s poorest regions and the state of the local ecosystems is a close one. When 
wetlands, inland and coastal fisheries, woodlands, forests, ponds and lakes, and 
grazing fields are damaged (owing, say, to agricultural encroachment, or urban 
extensions, or the construction of large dams, or organizational failure at the village 
level), traditional dwellers suffer. For them - and they are among the poorest in 
society - there are frequently no alternative source of livelihood. In contrast, for 
rich eco-tourists or importers of primary products, there is something else, often 
somewhere else, which means that there are alternatives. Whether or not there are 
substitutes for a particular resource is therefore not only a technological matter, nor 
a mere matter of consumer taste: among poor people location can matter too. The 
poorest of the poor experience non-convexities in a way the rich do not. Even the 
range between a need and a luxury is context-ridden. Macroeconomic reasoning 
glosses over the heterogeneity of Earth’s resources and the diverse uses to which 
they are put - by people residing at the site and by those elsewhere. 



3. Illustration, 1: A Convex Production Economy 

it will prove useful to illustrate the theory by means of a simple example, based on 
Ramsey (1928) and Solow (1956). As in Section 2.6, imagine that there is an all- 
purpose durable good, whose stock at t is Kt (> 0). The good can be consumed or 
reinvested for its own accumulation. There are no other assets. Write output (GNP) 
as Y. Technology is linear. So Y = /xK, where /x > 0. /x is the output-wealth ratio. 
GNP at t is Yt = /xKt. 

Imagine that a constant proportion of GNP is saved at each moment. There is 
no presumption though that the saving rate is optimum; rather, it is a behavioural 
characteristic of consumers, reflecting their response to an imperfect credit market. 
Other than this imperfection, the economy is assumed to function well. At each 
moment expectations are fulfilled and all markets other than the credit market clear. 
This defines the resource allocation mechanism, a. Clearly, a is autonomous in 
time. We now characterise a explicitly. 




160 



KENNETH J. ARROW ET AL. 



Let the saving ratio be s (0 < s < 1). Write aggregate eonsumption as Cf 
Therefore, 

Q = (1 - s)Yt = (1 - s)/xK,. (18) 

Capital is assumed to depreeiate at a eonstant rate y (> 0). Genuine investment is 
therefore, 

dKt/dt = (s/x - y)Kt. (19) 

Ko is the initial eapital stoek. The eeonomy grows if s/x > y, and shrinks if s/x < 
y. To obtain a feel for orders of magnitude, suppose y = 0.05 and ii = 0.25. The 
eeonomy grows if s > 0.2, and shrinks if s < 0.2. 

Integrating (19), we obtain. 



K, = r > t > 0, 




(20) 


from which it follows that, 

C, = (1 - s)/xK, = (1 - s)/xKte(^'^->^)("-b, 


T > t > 0. 


(21) 



If the eapital stoek was ehosen as numeraire, wealth would be Kt, and NNP 
would be (/X - y)K(. Eaeh of wealth, GNP, NNP, eonsumption and genuine invest- 
ment expands at the exponential rate (s/x - y) if s/x > y; they all eontraet at the 
exponential rate (y - s/x) if s/x < y. We have introdueed eapital depreeiation into 
the example so as to provide a whiff (albeit an artifieial whiff) of a key idea, that 
even if eonsumption is less than GNP, wealth deelines when genuine investment is 
negative. Wealth deelines when eonsumption exeeeds NNP. 

Current utility is U(Ct). Consider the form 

U(C) = where ?/ > l.^o (22) 

rj is the elasticity of marginal utility and 8 is the social rate of discount if utility 
is numeraire. Let pt be the social rate of discount if consumption is the unit of 
account. It follows that 



p^ = 8 + p(dCt/dt)/Ct = 3-1- p(s/x - y). 



(23) 



The sign of pt depends upon the resource allocation mechanism a. In particular, pt 
can be negative. To see why, suppose the unit of time is a year, 3 = 0.03, y = 0.04, 
s = 0.10, p = 2, and /x = 0.20. Then p(dCt/dt)/Ct = -0.04 per year, and (23) says 
that Pt = -0.01 per year.^^ 

Social welfare at t is. 



/ CXD 

U(C,)e“^^"“bdT. 

Using (21) and (22) in (24), we have: 

/ CXD 

g-[(„-l)(s,.-y)+^](r-t)j^^ 



(24) 
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or, assuming that [{rj — l)(s/U, — y) + ^] > 0, 

Vt = -[(1 - s)/xK,]-("-'V[(»7 - l)(s/x -y) + .5]. 



(25) 



V is differentiable in K everywhere. Moreover, 9Vt/9t = 0. Equations (20) and (25) 
confirm Theorem 

We turn now to accounting prices. 

(i) UTILITY NUMERAIRE 

Begin by taking utility to be numeraire. Let pt be the accounting price of capital. 
Now 



Pt 



/ C» 

U'(C,)[9C,/9Kt]e-^("-‘)dT. 



Using (25) in (26) we have, 

Pi = (ri- 1)[(1 - s)/x]-("-')KrV[(»7 - l)(s/x -Y) + 9]. 



(26) 



(27) 



Using equations (20), (21), (25), and (27) it is simple to check that pt U'(Ct), 
except when s = (/x + (17 — l)y — 9) //xij. Let s* be the optimum saving rate. Lrom 
equation (25) we have. 



s* = (/x + (?7 - l)y -S)/ixT]. 



(28) 



Note that pt < U'(Ct) if s > s*, which means there is excessive saving. 
Conversely, pt > U'(Ct) if s < s*, which means there is excessive consumption. 



(ii) CONSUMPTION NUMERAIRE 

Write Pt = pt/U'(Ct). 

Using (26) in (29) yields 

/ CX2 

[U'(C,)/U'(Ct)][9C,/9Kt]e-'(^-7dr. 
Now use (21), (22) and (30) to obtain 

/ CXD 

(1 - s)/xe(-^+(^'^-’^»("-7dT, 

where p = 8 + rj{sp. — y). 

Lrom (31) we have 

Pt = (1 - s)/x/(p - (s/x - y)). 

Observe that Pt > 1 (resp. < 1) if s < s* (resp. > s*).^^ 



(29) 



(30) 



(31) 



(32) 
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In order to obtain a sense of orders of magnitude, suppose tj = 2, jj. = 0.20, y = 
0.05, and 5 = 0. From (28) we have s* = 0.625. Now imagine that s = 0.40 (by 
Ramsey’s criterion, this is undersaving!). Using (23) we have p = 0.06 per unit of 
time. So (32) reduces to pt = 4. In other words, a saving rate that is approximately 
30 percent short of the optimum corresponds to a high figure for the accounting 
price of investment: investment should be valued four times consumption. 

Although intergenerational equity is nearly always discussed in terms of the 
rate at which future well-being is discounted (see e.g., Fortney and Bryant 1998), 
equity would be more appropriately discussed in terms of the curvature of U. Let 
the unit of time be a year. Suppose y = 0, 8 = 0.02, and /x = 0.32. Consider two 
alternative values of ly. 25 and 50. It is simple to confirm that s* = 0.038 if = 25 
and s* = 0.019 if r] = 50. Intergenerational equity in both consumption and welfare 
(the latter is a concave function of the former) can be increased indefinitely by 
making t] larger and larger: Ct becomes “flatter” as r] is increased. In the limit, as t] 
goes to infinity, s* tends to y (equation (28)), which reflects the Rawlsian maxi-min 
consumption as applied to the intergenerational context.^"* 

4, Illustration, 2: A Non-convex Ecosystem 

The Ramsey-Solow economy discussed above is convex. In this section we confirm 
that the theory presented in Section 2 can be applied to non-convex economies. We 
do this by studying a model of shallow lakes.^^ 

A key determinant of the overall state of a shallow lake is phosphorus, which is 
a necessary nutrient for such ecological services in the lake as those that provide 
a habitat for fish populations. But at high levels of concentration phosphorus is 
a pollutant, causing as it does increased plant growth, algae blooms, decrease in 
water transparency, bad odour, oxygen depletion, and fish kills. Thus, the state of a 
lake can be taken to be the quantity of phosphorus in the water column, which we 
denote by a scalar, S. 

The rate of phosphorus inflow into a lake is a byproduct of agriculture in the 
watershed (e.g., as fertilizer runoff from farms). We bring these considerations 
together and postulate that current utility is a strictly concave and twice differ- 
ential function U(C, S), where U is an increasing function of phosphorus inflow, 
C. Imagine next that phosphorus has a deletarious effect on the lake at all levels 
of concentration (and not just at high levels of concentration); which is to say that 
U is a decreasing function of S for all S. This assumption brings into sharp relief 
those economic problems where a produced good has positive social worth as a 
flow, even though it is a pollutant as a stock. 

Social welfare at t is 

/ CX2 

U(C^, where Us < 0 and Uc > 0. 
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4.1. CONSTANT PHOSPHORUS INFLOW 

For simplicity of exposition, we suppose in what follows that 

U(C, S) = log C - hS^ h > 0. (33) 

Consider the case where the resource allocation mechanism for phosphorus inflow 
is such that Ct is a constant, say C. Studies have confirmed that there is a feedback 
of phosphorus from bottom sediments when the density of algae in the lake is large. 
This feedback is reflected in the form of recycling - from sediment to the water 
column. Experiments suggest that the recycling rate, R, is a sigmoid function of S. 
A simple form of the relationship is, 

Rt = bSt^/(l + St^), where b > 0. (34) 

The rate of input of phosphorus into the water column is therefore [C+ 

bStV(i + St')]. 

However, phosphorus is depleted from the water column owing to sedimenta- 
tion and water outflow. Assuming that the rate of loss is proportional to S, say yS 
(y > 0), the phosphorus content in the lake’s water column is governed by the 
equation, 

dSt/dt = C + bSt'/(l + St') - ySt. (35) 

For a range of parameter values C, b, and y , the curves [C 4- bS'/ (1 4- S')] and y S 
intersect at three points. This is shown in Figure 1. The upper and lower intersects, 
S3 and Si, are stable stationary points of (35), whereas the intermediate intersect, 
S2, is unstable. Thus, S2 is the unique separatrix of the dynamical system. S3 and 
Si should be thought of as eutrophic and oligotrophic states, respectively. Thus, 
given St, the resource allocation mechanism, a, governing the lake’s quality can be 
expressed as, 

dS,/dT = C + bS,'/(l + S,')-yS„ t > t. (36) 

Clearly, a is autonomous and time consistent. It is simple to confirm that V(S) 
is differentiable in S everywhere, excepting S2. It is simple to confirm as well 
that, although V(S) is discontinuous at S2, it possesses both right- and left-hand 
derivatives there. We can therefore define the accounting price of the lake’s quality 
to be p(S) = 9V/9S at all S 3^ S2 and apply the theory locally for the purposes of 
project evaluation and sustainability assessment. It should be noted that because 
phosphorus is a pollutant in the lake, p(S) < 0.^^ 
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4.2. OPTIMUM PHOSPHORUS INFLOW 

The resource allocation mechanism defined by (36) reflects an imperfect economy. 
Brock and Starrett (2003) have studied the optimum resource allocation mecha- 
nism. To review their work, we generalize (36). If Ct is the inflow of phosphorus, 
the lake’s dynamics are given by the equation, 

dSt/dt = Ct + bStV(l + St^)-ySt, fort>0, (37) 

where So is given as an initial condition. 

The problem is to choose {Ct}o“ so as to maximize (33), subject to (37). 
Clearly, the optimum resource allocation mechanism is both autonomous and 
time consistent. In what follows, we restrict ourselves to the case where the 
optimum is an interior one (i.e., Ct > 0). Let pt be the accounting price of 
phosphorus in the lake. Brock and Starrett confirmed that, for {Ct)o°° to be an 
optimum, it is necessary that Ct and St satisfy not only (37), but also the Pontryagin 
conditions. 



Pt = — Uc(< 0), for all t, (38) 

and (dpt/dt)/pt = 5 -h y — Us/Uc — 2bSt/(l -h St^)^, for all t. (39) 

The point therefore is to select po (equivalently, Cq) optimally and allow the 
dynamical system to evolve in accordance with equations (37)-(39). The authors 
showed that, in the (p, S) space, equations (37)-(39) can have at most a countable 
number of stationary points. They studied in detail the class of parameter values 
for which the number of stationary points is three. They found that two of them 
(call them Si and S 3 , with Si < S 3 , corresponding to what could be interpreted 
to be the oligotrophic and eutrophic state, respectively) are saddle points, while 
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the intermediate point (call it S 2 ) is a spiral source (i.e., it is unstable). The 
authors showed that there exists a value of phosphorus stock, S, such that if So 
> S, the optimum programme asymptotes to S 3 ; but if So < S, it asymptotes to Si. 
In short, history matters.^* It is easy to confirm that if, by fluke. So = S, there are 
two equally desirable optimal programmes, one that asymptotes to Si, another that 
asymptotes to S 3 . This last property can be shown to imply that V(S), although not 
differentiable at S, is continuous at S and possesses both left- and right-derivatives. 
S is an endogenously determined separatrix.^^ 

Since the optimum resource allocation mechanism is autonomous, we may write 
by p(S) the optimum policy function. Phosphorus being a pollutant in the lake, we 
have p(S) < 0. It can be shown that V(S) is differentiable everywhere excepting 
at S. It can also be demonstrated that p(S) is discontinuous at S, but is left- and 
right-differentiable there. Moreover, 

p(S) = 9V/9S(< 0), for all S / S. (40) 

Writing by [p(S)]s_o (resp., [p(S)]s_,_o) the limit of p(S) as S tends to S from 
the left (resp., right), and similarly for [9V/9 S]s_q and [9V/9S]s_,_o, it can be 
shown too that [p(S)]s_o = [9V/9 S]s_q and [p(S)]s_,_o = [9V/9S]j_,_Q. The theory 
we have outlined in Section 2 is thus applicable to the optimum resource allocation 
mechanism of this particular non-convex economy. 

Having illustrated the theory by means of a three examples, we now proceed to 
obtain rules for estimating accounting prices. We do this by focussing on specific 
categories of capital assets and several well known institutional imperfections. 



5, Exhaustible Resources: The Closed Economy 

Accounting prices of exhaustible resources when depletion rates are optimal have 
been much studied (e.g., Dasgupta and Heal 1979; see below). What is the structure 
of their accounting prices when resources are instead common pools? 

Two property-rights regimes suggest themselves: open access and restricted 
entry. They in turn need to be compared to an optimum regime. It is simplest if 
we avoid a complete capital model. So we resort to a partial equilibrium world: 
income effects are assumed to be negligible. Let Rt be the quantity extracted at 
t. Income is the numeraire. Let U(R) be the area under the demand curve below 
R. So U'(R) is taken to be the market demand function. U is assumed to be an 
increasing and strictly concave function of R for positive values of R. In order to 
have a notation that is consistent with the one in the foregoing example, we take the 
social rate of interest to be an exogenously given constant, p. Let St be the stock. 
Then, 



dSt/dt = -Rt. 



(41) 
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5.1. THE OPTIMUM REGIME 



In order to construct a benchmark against which imperfect economies can be eval- 
uated, we first study an optimizing economy. Assume that extraction is costless 
(constant unit extraction cost can be introduced easily). Social welfare at t is, 



Vt = 



/ oo 

U(Rr)e-^(^-‘Mr. 



(42) 



Let pt* denote the accounting price of the resource underground (equivalently, the 
Hotelling rent, or the optimum depletion charge per unit extracted). We know that 



dp*/dt = pp*. 



(43) 



This is the Hotelling Rule. Moreover, optimum extraction, Rt*, must satisfy the 
condition. 



U'(Rt) = Pt*. 

Assume that 

U(R) = — where rj > \. 

Then 

Rt* = (p/rj)Soe-P^^\ 

We next consider the two imperfect regimes. 



(44) 



(45) 



(46) 



5.2. RESTRICTED ENTRY 

For vividness, assume that there are N identical farmers (i, j = 1, 2, . . ., N), drawing 
from an unrechargeable aquifer. Extraction is costless. We model the situation in 
the following way:^** 

At t, farmer i owns a pool of size Sit. Each pool is separated from every other 
pool by a porous barrier. Water percolates from the pool which is larger to the one 
which is smaller. Let Ay (> 0), be the rate at which water diffuses from pool i to 
pool j. We assume that Ay = Aji. Denote by Rit the rate at which i draws from his 
pool. There are then N depletion equations: 



dSit/dt = - Sit)] - Ri. 



(47) 



N-i 



where “XIn-i” denotes summation over all j other than i. 
The payoff function for farmer i at time t is 



i: 



U(Rir)e“^<"“7dT. 



(48) 
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Farmers play non-cooperatively. For tractablity, we study an open loop solution: 
Farmers are assumed to be naive (when eomputing his own optimum extraction 
rates, each takes the others’ extraction rates as given). 

Let Pit be the (spot) personal accounting price of a unit of i’s own resource pool. 
The present value Hamiltonian for i’s optimization problem would then be, 



Ho = U(Rit)e-^‘ + 



^^Aji(Sjt Sit) Rit 



Pite 



-pt 



.N-i 



It follows from (49) that pit obeys the equation, 

dpit/dt = (p + ^ kji)pit. 

N-i 



(49) 



(50) 



For notational simplicity, assume that Ay = A for all i, j. Then (50) reduces to 
dpit/dt = (p + (N - l)A)pit. (51) 



Write (p + (N — 1)A) = p. We conclude that the rush to extract because of 
insecure property rights amounts to each extractor using an implicit discount rate, 
yd, which is in excess of the social discount rate p.^' Assume now that the elasti- 
city of demand is a constant, t] (> 1). Using (46) and (51), we conclude that the 
extraction rate from the common pool is 

R, = (y 6 /j 7 )Ste“^^"“‘^/", for all t > t. (52) 



In order to have a meaningful problem, we take it that fi/r] > — p (see below). 

Let Pt be the resource’s (social) accounting price. We know that pt = 9Vt/9St. 
Using (46), it follows that, 

/ CXD 

U'(R,)[9R,/9S,]e-^(^-«dr. (53) 

Write Pt = pt/U'(Rt). Then (51) and (53) imply 

Pt = y6/(/l-i7(/l-p)) > 1. (54) 

(Notice that pt = 1 if /I = p.) 

As a numerical illustration, consider the case where p = 0.06, P = 0.10, and p = 
2. In this case, pt = 5, which reflects a considerable imperfection in the resource 
allocation mechanism in question: the resource’s accounting price is five times its 
market price. 



5.3. OPEN ACCESS 

We next study an open-access pool. To have a meaningful problem, we now assume 
that extraction is costly. For simplicity, let the unit extraction cost be a constant 
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k (> 0). Under open aeeess, Hotelling rents are dissipated eompletely. Therefore, 
the equilibrium extraction rate, Rt, is the solution of the equation, 

U'(Rt) = k. (55) 

Equation (55) confirms that, for any given level of reserves, there is excessive 
extraction. Let R be the solution of (55). We then have, 

dSt/dt = -R. 

Reserves remain positive for a period T = S/R. Let us normalize utility by setting 
U(0) = 0. It follows that, 

^(t+S,/R) 

Vt = J (U(R) - kR)e“^("-‘Mr. (56) 

Let pt be the accounting price of the unextracted resource. Then, 

pt = 9Vt/9St = [(U(R) - kR)/R]e“^^‘/^ > 0. (57) 

Write pt = pt/U'(R), which is the ratio of the resource’s shadow price to its unit 
extraction cost. Then, from (55) and (57), 

Pt = [(U(R) - kR)/kR]e“^^‘/^ > 0. (58) 

(58) resembles a formula proposed by El Serafy (1989) for estimating depletion 
charges. The charge is positive because an extra unit of water in the aquifer would 
extend the period of extraction. Notice that pt is bounded above by the ratio of the 
Marshallian consumer surplus to total extraction cost; furthermore, it increases as 
the aquifer is depleted and attains its upper bound at the date at which the pool is 
exhausted. If reserves are large, pt is small, and free access involves no great loss 
- a familiar result. 

What are plausible orders of magnitude? Consider the linear demand function. 
Assume therefore that 

U(R) = aR — bR^, where a > k and b > 0. (59) 

Lrom (55) and (59), 

R = (a - k)/2b. (60) 

Substituting (59) and (60) in (58), 

Pj = ((a - k) / 2 k)e“ 2 '’^S'/("-k). (61) 

Equation (61) says that 

p > 1 iff pS < ((a - k)/2b)ln((a - k)/2k). 
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(61) expresses the magnitude of p in terms of the parameters of the model. Suppose, 
for example, that p = 0.02 per year, S/R = 100 years (i.e., at the current rate of 
extraction, the aquifer will be exhausted in 100 years), (a - k)/2k = 20 (e.g., k = 
$0.50 and (a - k) = $20). Then 

p = 20exp(— 2) ~ 7. (62) 

We should conclude that the value to be attributed to water at the margin is 
high (about 7 times extraction cost). As the date of exhaustion gets nearer, the 
accounting price rises to its upper bound, 20. 



6. Exploration and Discoveries 

How should one account for expenditure on explorations of new deposits of 
exhaustible resources? We imagine that the rate at which new reserves are 
discovered (N) is an increasing function of (1) current expenditure on explorations 
(E) and (2) the accumulated expenditure on explorations (M), but is a declining 
function of (3) accumulated extraction (Z). Denote the discovery function be N(Et, 
Mt, Zt), where 



dMt/dt = Et, (63) 

and dZt/dt = Rt. (64) 

We revert to the model containing one manufactured capital good, K, and an 
exhaustible natural resource, S. In the familiar notation, Y = E(K, R) is taken to 
be the aggregate production function. The remaining equations of motion are. 



dKt/dt = E(Kt, Rt) - Ct - E(. (65) 

dSt/dt = N(Et, Mt, Zt) - Rt. (66) 

The model has four capital assets K, S, M, and Z. Their accounting prices are 
denoted by Pk, Ps> Pm> and pz, respectively. Social welfare is given by (1). Erom 
Theorem 1, we have 

dVt/dt = (67) 

PK[F(Kt, Rt) — Q — Et] + ps[N(Et, Mt, ZJ — RJ + piviEt + PzRt- 

There are two cases to consider: 

(A) Assume that 9N/9M = 0 (implying that Pm = 0) and 9N/9Z < 0 (implying 
that pzt < 0). Even in this case genuine investment is not the sum of investment in 
manufactured capital and changes in proven reserves (Nt - Rt). This is because new 
reserves are valued differently from existing reserves. Note too that exploration 
costs should not be regarded as investment. 

Consider now the special case where the mining industry optimizes. Then 
Pk = Ps9N/9E. If, in addition, psNt can be approximated by pREt, one could 
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exclude discoveries of new reserves from genuine investment, but regard instead 
exploration costs as part of that investment. 

(B) Suppose 9N/9M > 0. If the industry optimizes, we have 



Pk = Pm + Ps9N/9E, (68) 

and so Pk > Pm- It follows that genuine investment should now include not only 
new discoveries and investment in manufactured capital (as in Case A), but also 
exploration costs, using an accounting price that is less than that of manufactured 
capital. 



7. Forests and Trees 

As stocks, forests offer a multitude of services. Here we focus on forests as a source 
of timber. Hamilton and Clemens (1999) regard the accounting value of forest 
depletion to be the stumpage value (price minus logging costs) of the quantity of 
commercial timber and fuelwood harvested in excess of natural regeneration rates. 
This is an awkward move, since the authors do not say what is intended to happen 
to the land being deforested. For example, if the deforested land is converted into an 
urban sprawl, the new investment in the sprawl would be recorded in conventional 
accounting statistics. But if it is intended to be transformed into farmland, matters 
would be different: the social worth of the land as a farm should be included as an 
addition to the economy’s stock of capital assets. In what follows, we consider the 
simple case where the area is predicted to remain a forest. 

Let the price of timber, in consumption numeraire, be unity and let p (assumed 
constant) be the social rate of discount. Holding all other assets constant, if Bt is 
aggregate forest land at, we may express social welfare as V(Bt). The accounting 
price of forest land is then 9Vt/9Bt, which we write as pt. 

Consider a unit of land capable of supporting a single tree and its possible 
successors. If the land is virgin, if a seed is planted at t = 0, if F(T) is the timber 
yield of a tree aged T, and if T is the rotation cycle, then the present discounted 
value of the land as a tree-bearer is, 

Po = F(T)e-"V(l-e-^^). (69) 

Suppose instead that at t = 0 the piece of land in question houses a tree aged t. 
What is the value of the land? 

If the cycle is expected to be maintained, we have 

po = F(T)e“^('^“"V(l - (70) 

If instead the tree is logged now, but the cycle is expected to be maintained, the 
value of the land, after the tree has been felled, is given by (69). Depreciation of 
the forest, as a capital asset, is the difference between (70) and (69). 
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8. Human Capital 

To develop an accounting framework for knowledge acquisition and skill forma- 
tion, consider a modified version of the basic model of Section 2. In particular, the 
underlying resource allocation mechanism is assumed to be autonomous. Labour 
hours are assumed to be supplied inelastically and population is constant, we may 
as well then normalize by regarding the labour-hours supplied to be unity. 

Production of the consumption good involves physical capital, K^, and human 
capital. Hit. Here, Hu is to be interpreted to be the human capital embodied in those 
who work in the sector producing the consumption good. Thus, if Yt is output of 
the consumption good, 

Yt = F(Ku,Hu), (71) 

where F is an increasing function of its arguments. 

Assume that human capital is produced with the help of physical capital, K 2 t, 
and human capital, H 2 t, and that, owing to mortality, it depreciates at a constant 
rate, y . Output of human capital is given by the technology 

G(K2t, H2t), (72) 

where G is an increasing function of its arguments and strictly concave, repre- 
senting that the input of students is given. 

By assumption, all individuals at a given moment of time have the same amount 
of human capital. Therefore, Hu/(Hit -i- H 2 t) is the proportion of people employed 



in the sector producing the consumption good. Let the total stock of human capital 
be H. It follows that 

Hit + H2t = Ht. (73) 

Write 

Kit + K2t = Kt. (74) 

For simplicity of exposition, we assume that physical capital does not depreciate. 
Accumulation of physical capital can be expressed as 

dKt/dt = F(Kit, Hit) - Ct, (75) 

and the accumulation of human capital as 

dHt/dt = G(K2t, H2t) - yHt. (76) 

Since the resource allocation mechanism, a, is assumed to be autonomous, we have 

Vt = V(a,Kit,K2t,Hit,H2t). (77) 



Let Pit and p 2 t be the accounting prices of physical capital and qu and q 2 t the 
accounting prices of human capital, in the two sectors, respectively (i.e., pu = 
9Vt/9Kit, q 2 t = 9Vt/9H2t, and so forth). Therefore, wealth can be expressed as, 

Zt = PitKit + P2tK2t + qitHit + qitH2t, 
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and genuine investment by 

It = pitdKit/dt + p2tdK2t/dt + qitdHu/dt + q2tdH2t/dt. (78) 

Estimating q^ and q 2 t poses diffieult problems in praetiee. It has been eustomary 
to identify human eapital with edueation and to estimate its aeeounting priee in 
terms of the market return on edueation (i.e., salaries over and above raw labour). 
But this supposes, as we have assumed in the above model, that edueation offers no 
direct utility. If education does offer direct utility (and it is widely acknowledged 
to do so), the market return on education is an underestimate of what we should 
ideally be after. Furthermore, human capital includes health, which too is both a 
durable consumption good and capital good. 

An alternative is to use estimates of expenditures on health and education for 
the purpose in hand. Such a procedure may be be a reasonable approximation for 
poor societies, but it is in all probability far off the mark for rich societies. 

If a were an optimum resource allocation mechanism, we would have pu = 
P 2 t = Pt> say, and q^ = q 2 t = qt, say. These prices would be related by the optimality 
conditions 

U'(Q) = p,: Pt9F/9Ki = q,9G/9K2: 

and Pt9F/9H] = qt9G/9H2. 



9, Global Public Goods 

Countries interact with one another not only through trade in international markets, 
but also via transnational externalities. Hamilton and Clemens (1999) include 
carbon dioxide in the atmosphere in their list of assets and regard the accounting 
price (a negative number) of a country’s emission to be the amount it would be 
required to pay the rest of the world if carbon emissions were the outcome of a 
fully cooperative agreement. Their procedure is, consequently, valid only if each 
country is engaged in maximising global welfare, an unusual scenario. In what 
follows, we develop the required analysis. 

Fet Gt be the stock of a global common at t. We imagine that g is measured in 
terms of a “quality” index which, to fix ideas, we shall regard as carbon dioxide 
concentration in the atmosphere. Being a global common, G is an argument in the 
value function V of every country. For simplicity of notation, we assume that there 
is a single private capital good. Fet Kjt be the stock of the private asset owned by 
citizens of country j and let aj be j ’s (autonomous) resource allocation mechanism 
and a the vector of resource allocation mechanisms. If Vj is j’s value function, we 
have 



Vjt = Vj(a,Kjt,Gt). (79) 

Fet pjt = 9Vjt/9Kjt and gjt = 9Vjt/9Gt. It may be that G is an economic “good” for 
some countries, while it is an economic “bad” for others. For the former, gj > 0; 
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for the latter, gj < 0. Let E^t be the emission rate from eountry k and let y be the 
rate at which carbon in the atmosphere is sequestered. It follows that 

dGt/dt = ^Ekt-yGf (80) 

k 

Genuine investment in j is, 

It = dVjt/dt = pjtdKjt/dt + gjtdGt/dt, 
which, on using (80), can be expressed as 

It = PjtdKjt/dt + gjt ^ Ekt - yGt^ . ( 81 ) 

Notice that the expression on the RHS of (81) is the same whether or not a is based 
on international cooperation. On the other hand, dKjt/dt and dGt/dt do depend on 
how the international resource allocation mechanisms are arrived at (e.g., whether 
they are cooperative or non-cooperative); and they affect the accounting prices, pjt 
and gjt.^^ 

10. Exogenous Productivity Growth 

To assume exogenous growth in total factor productivity (the residual) over the 
indefinite future is imprudent. It is hard to believe that serendipity, unbacked by 
R&D effort and investment, can be a continual source of productivity growth. 
Moreover, many environmental resources go unrecorded in growth accounting. If 
the use of natural capital in an economy has in fact been increasing, estimates of 
the residual could be presumed to be biased upward. On the other hand, if a poor 
country were able to make free use of the R&D successes of rich countries, it would 
enjoy a positive residual. 

The residual can have short bursts in imperfect economies. Imagine that a 
government reduces economic inefficiencies by improving the enforcement of 
property rights, or reducing centralized regulations (import quotas, price controls, 
and so forth). We would expect the factors of production to find beffer uses. 
As factors realign in a more productive fashion, folal factor producfivify would 
increase. 

In fhe opposife vein, fhe residual could become negafive for a period. Increased 
governmenf corrupfion could be a cause; fhe cause could also be civil slrife, which 
desfroys capifal assefs and damages a counfry’s insfilulions. When insfilulions 
deferiorafe, assefs are used even more inefficienfly fhan before and fhe residual 
declines. This would appear to have happened in sub-Saharan Africa during fhe 
pasf forly years (Collins and Bosworfh 1996). 

We now sfudy susfainabilify in fhe confexf of fwo models of exogenous 
producfivify growfh. 
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10.1. LABOUR- AUGMENTING TECHNICAL PROGRESS 

Consider an adaptation of the model explored in Seetion 3. Physieal eapital and a 
eonstant labour foree together produee a non-deteriorating all purpose eommodity. 
The eeonomy enjoys labour augmenting teehnologieal progress at a eonstant rate 
n. If K is eapital and A is knowledge, we have in the usual notation, 



Yt = F(Kt,At), 


(82) 


dK,/dt = F(K„ At) - Ct, 


(83) 


dAt/dt = nAt. 


(84) 



There are two eapital goods, K and A. Let Pk and Pa, respectively, be 
their accounting prices in utility numeraire. The sustainability criterion is then 
PxdKt/dt -I- pAdAt/dt > 0, or, equivalently, 

dKt/dt -h qtdAt/dt > 0, where qt s Pa/Pk- (85) 

It is instructive to study the case where the resource allocation mechanism is 
optimal. The equations of motion for Pk and Pa are, 

dps/dt = ^Pk - Pk9F/9K, 
and dpA/dt = 9pA — Pk 9F/9A — npA 

Using (85)-(87) yields, 

dqt/dt = (9F/9K - n)qt - 9F/9A. (88) 

Suppose F displays constant returns to scale. Define k = K/A and c = C/A. Write 
f(k) s F(k, 1). From (83) and (84) we have 



( 86 ) 

(87) 



dk(/dt = f(kt) — nkt — Ct, (89) 

or dkt/dt = (9F/9K)kt -h 9F/9A - nkt - U. 

Adding (88) and (89) yields 

d(qt -h kt)/dt = (9F/9K - n)(qt 4- kt) - Cj. (90) 

It is simple to confirm that q -i- k is the present value of future consumption 
(discounted at the rate 9F/9K) divided by A (the current state of knowledge). It 
follows that the sustainability criterion at t (condition (85)), divided by At, is 

dkt/dt + n(kt -|- %) > 0. (91) 



10.2. RESOURCE AUGMENTING TECHNICAL PROGRESS 

Consider an alternative world, where output, Y, is a function of manufactured 
capital (K) and the flow of an exhaustible natural resource (R). Let AtRt be the 
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effective supply of the resource in production at t and St the resource stock at t. 



Then we may write, 

Yt = F(Kt,AtRt), (92) 

dKt/dt = F(Kt, AtRt) - Ct, (93) 

dAt/dt = n, (94) 

dSt/dt = -Rf (95) 



There are three state variables. But we can reduce the model to one with two state 
variables. Thus, write Qt s AtRt and Xt = AtSt. Then (93) and (94) become. 



dKt/dt = F(Kt, Qt) - Ct, 
and dXt/dt = nXt — Qt. 



(96) 

(97) 



This is equivalent to a renewable resource problem, and the steady state is the 
Green Golden Rule, with 



nX = Q. (98) 

Let Pk and px be the accounting prices of Kt and Xt, respectively. Then the 
sustainability condition is, 

PxdKt/dt + pxdXt/dt > 0. (99) 

It is instructive to study the case where the resource allocation mechanism 
is optimal. Suppose also that F displays constant returns to scale. Following the 
approach of the previous example, let qt = px/px- Then it is easy to confirm that 

(dqt/dt) /qt = 9F/9K - n. (100) 

Moreover, the optimal use of the productivity adjusted natural resource, Qt, is 
determined by the condition, 

9F/9Q = qt. (101) 

Along the optimal programme, the sustainability condition (99) is, 

F(Kt, Qt) - Ct + qt(nXt - Qt) > 0, (102) 

or (9F/9K)Kt + (9F/9Q)Qt - Ct + %(nXt - Qt) > 0, (103) 

or (9F/9K)Kt - Ct + nqtXt > 0. (104) 

Inequality (104) says that consumption must not exceed the sum of capital income 
and the sustainable yield. 
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11, Exhaustible Resources: The Exporting Economy 

The export of natural resourees at given world priees raises issues similar to those 
we have just eneountered in our analysis of exogenous productivity change. The 
exogenous “drift” term, 9Vt/9t, in equation (8) has to be estimated. 

Assume that extraction is costless. Suppose that at time t the world market price 
of an exhaustible resource is q^. If Rt is the volume of export, revenue is q^Rr- 

Write Cr = q^Rr- (105) 

The country’s export policy, being governed by the underlying a, can be expressed 
as R(r, St, t) for t > t. From equation (105) it follows that 

dC,/dt = q^dRJdt = (9CJ9St)dSt/dt + q,9R,/9t, (106) 

As before, we assume that social welfare at t is, 

/ OO 

U(Cr)e“^("-‘Mr. (107) 

Let pt denote the resource’s accounting price. Since the criterion for sustainable 
well-being is dVt/dt, we differentiate both sides of equation (107) with respect to t 
to obtain, 

dVt/dt = (108) 

/ CXD 

U'(C,)[(9C,/9St)dSt/dt + q,9R,/9t]e-^<^-«dT. 
But 



dSt/dt = -Rt. 

Therefore, equation (108) reduces to 

/ OO 

U'(C,)e“^<"“‘H9C,/9t)dr. (109) 
Define /x(T, t) = 9C,/9r -h 9C,/9t. (110) 

/x(r, t) can be regarded as an index of the extent to which the resource allocation 
mechanism is non-autonomous. Using equations (105)-(107) and (110), equation 
(109) can be reexpressed as, 

/ OO 

U'(C Je“^<"-‘V(T, t)dT (111) 

) 

U'(C,)e“^("-‘)(9CT/9r)dT. 



-f 



On partially integrating the last term on the RHS of equation (111) and cancelling 
terms, we obtain. 



dVt/dt = ptdSt/dt -h 



/ OO 

U'(C,)e“^("-‘V(T, t)dr. 



( 112 ) 
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The integral on the RHS of (112) is the “drift” term. As (112) shows, the index of 
sustainable welfare is the algebraie sum of genuine investment and the drift term. 
We now proceed to obtain simple rules for estimating the index in the case of two 
special non-optimum resource allocation mechanisms. 

Suppose C is constant.^^ In this case, 

9C,/9t = 9C,/9t = 0, 

and /x(t, t) = 0 in (1 12) is zero, and genuine investment measures changes in social 
welfare. 

Suppose instead R is constant. It follows that 
9RJ9r -h 9R^/9t = 0, 

and /x(t, t) = R^dq^/dr = qTR^(9qt/9r)/q^ 

Using (113) and (114), we may write, 

CXD 

U'(CT)e“^^^“‘V(t, t)dr = /Xt/9, (115) 

where /Xt can be interpreted as the average capital gains on the world market, as 
viewed from time t. Formally, (112) can be re-written as, 

dVt/dt = ptdSt/dt -h /Xt/9. (116) 



(113) 

(114) 



12. Defensive Expenditure 

How should defensive expenditure toward pollution control appear in national 
accounts? Denote by Qt the stock of defensive capital and Xt investment in its 
accumulation. Let Pt be the stock of pollutants and Yt aggregate output. We may 
then write. 



dPt/dt = G(Yt, Qt) - 7 tP„ (117) 

where G(Yt, Qt) > 0, 9G/9Y > 0 and 9G/9Q < 0. 

Moreover, if defensive capital depreciates at the rate y (> 0), then 

dQt/dt = Xt - yQt. (118) 

In the usual notation, the accumulation equation is expressed as, 

dKt/dt = F(Kt) - Ct - Xt. (119) 

Denote by pt the accounting price of K, mt that of defensive capital, and q (< 0) 
the accounting price of the pollutant. Wealth can then be expressed as. 



PtKt -h mtQt -h qPt, 
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and genuine investment at t as, 

Ij = ptdKt/dt + mtdQt/dt + rtdPt/dt. (120) 

Equation (120) says that defensive expenditure against pollution ought to be 
ineluded in the estimation of genuine investment (mtdQt/dt), but, then, so should 
ehanges in the quality of the environment be ineluded (pdEt/dt). To inelude the 
former, but not the latter, would be a mistake. 



13. Population Change and Sustainable Development 

How does demographie ehange affeet the index of sustainable development? There 
are a number of coneeptual problems inherent in the welfare economies of repro- 
ductive behaviour that still remain usettled. Such problems have typically been 
bypassed in growth accounting; instead, it has been customary there to regard 
changes in population to be exogenously given. We follow that practice here.^* 

We seek to determine how population change influences the drift term (9Vt/9t) 
on the RHS of equation (8). An equivalent way of casting the problem is to regard 
population as a capital asset. Once we do that, what could appear to be a non- 
autonomous model reduces to an autonomous one. To illustrate, we adopt a natural 
extension of Harsanyi (1955) by regarding social welfare to be the average utility 
of all who are ever born. We formalize this ’dynamic average utilitarianism’ as 
follows: 

Let Nt be population size at t and n(Nt) the percentage rate of change of Nt.^^ 
For notational simplicity, we ignore intragenerational inequality and changes in 
the age composition of the population. Let Ct denote per capita consumption at t. If 
Ct is aggregate consumption, Ct = Ct/Nt- Assume as before that labour is supplied 
inelastically in each period. Current utility of the representative person is U(Ct) and 
social wefare is, 

/ oo poo 

N,U(cJe“^^^“‘Mr/ J (121) 

If Vt is to be well-defined, we need to suppose that there exists £ > 0, such that 
(9 — fi)t > /Qu(N^)dr for large enough t. Notice though that, once we are given 
the population forecast, the denominator in (121) is independent of the policies that 
could be chosen at t. This means that a policy deemed to be optimal if (121) were 
used as the criterion of choice would also be judged to be optimal if instead social 
welfare Vt were taken to be of the form, 

/ CXD 

N,U(cJe“^(^“‘Mr. (122) 

But for assessing whether or not a pattern of development sustains Vt, it matters 
whether Vt is taken to be (121) or (122). 
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Let Kit denote the stoek of the ith type of eapital good and write kit = Kit/Nt- 
We now express by kt the veetor of eapital stoeks per head. The state variables are 
therefore kt and Nt- We take it that a is autonomous. Then equation (121) implies 
that 



Vt = V(kt,Nt). (123) 

Let the numeraire be utility. Define Vt = 9Vt/9Nt. It is the eontribution of an 
additional person at t to soeial well-being. r<t is the aeeounting priee of a person 
(as distinet from the aeeounting priee of a person’s human eapital). Note that Vi 
ean be negative, depending on initial eonditions at t and on the resouree alloeation 
meehanism. 

Let Pit denote the aeeounting price of kit. Equation (123) then implies 
dVt/dt = ^^pitdkit/dt + VtdNt/dt. (124) 



The RHS of equation (124) is genuine investment, inclusive of the change in the 
size of the population. It generalizes equation (8). We conclude that Proposition 1 
remains valid so long as wealth comparisons mean comparisons of wealth per 
capita, adjusted for demographic changes. 

In Arrow, Dasgupta, and Maler (2003), we have studied optimal economies in 
which the adjustment term (VtdNt/dt) is not negligible, but nevertheless can be esti- 
mated in a simple way. Dasgupta (2001b) identified a sef of circumsfances where 
fhe ferm vanishes even in an imperfecf economy. Suppose (i) n(Nt) is independenf 
of Nt; (ii) all fhe producfion processes are linear; and (iii) Ct = c(kt), meaning 
fhaf under fhe resource allocation mechanism a, per capita consumption is nol 
a funclion of populafion size. In such circumsfances Vt is independenf of Nt (i.e., 
Vt = 0) and, so, equation (124) reduces fo 

dVt/df = ^pitdkit/df. (125) 



This finding can be summarised as 

Theorem 6 

If (i) n(Nf) is independent ofNt, (ii) all the production processes are linear, and 
(iii) Ct = c(kt), then social welfare is sustained at a point in time if and only if the 
value of the changes in per capita capital assets at that instant is non-negative. 

The conditions underlying Theorem 6 are overly sfrong. If is fempfing neverfhe- 
less fo regard fhe value of changes in fhe per capifa slocks of capilal assefs as 
a firsl approximation of dVt/df and Ihen fo eslimale correction terms that reflect 
departures from the conditions underlying the theorem. That investigation is left 
for future work. 
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14. Uncertain Productivity 

How does future uncertainty in the productivity of capital assets influence 
accounting prices? In order to study this question in the simplest possible way, we 
revert to the Ramsey-Solow model of Section 3 and assume that the productivity 
of the single asset is uncertain. Analytically it is easiest to imagine that the under- 
lying stochastic process generates a return on investment that is independently and 
identically distributed (iid) in each period. For convenience we now suppose that 
time is discrete (t = 0, 1,2, . . .). In what follows we indicate that a variable is 
random by placing a tilde over it. Let us denote the uncertain productivity of invest- 
ment at date t by /Xt. We assume that /Xt is non-negative and that the distribution of 
/Xt is atomless. 

Population is assumed to be a constant and aggregate saving is taken to be 
a constant proportion, s, of wealth, where 0 < s < 1. At each t the size of the 
capital stock that has been inherited from the previous period is a known quantity. 
Consumption is a fixed proportion (1 - s) of that inherited stock. Therefore, 
assuming that capital does not deteriorate, the discrete time, stochastic counterpart 
of the accumulation equation (19) is, 

K.+1 = (K, - Q)At, 



from which we conclude that 

Kt+i = s/XtKt, t > 0, 

and thus, 

Q = (1 - s)Kt[tn(^-i)(s/Xk)], r > t > 0. (126) 

Writing by U(C) the utility of consumption, we take it that social welfare (V) is the 
expected value of the sum of discounted utilities over time. Letting E denote the 
expectation operator, this means that 

CX2 

Vt = E[^U(C,)y6^^“‘>], where p = 1/(1 -h 8) and .5 > 0. (127) 

t 

Suppose utility is iso-elastic. Eet t] be the elasticity of marginal utility. We 
consider the empirically interesting case, t] > 1. We write U as: 

U(C) = C^“''/(l — T]), where rj > 1. (128) 

In (128), U is bounded above, but is unbounded below. 

Write E(/Xt*'“''^) = E(/x^^“''^). If Vt is to be well-defined, we must now suppose 
that 



y6s<'“''>E(/7(i-")) < 1. 



(129) 
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Using (126) and (128), and noting that the series in (127) is absolutely convergent, 
we can rewrite (127) as 

Vt = -(1 - s)<i“''^Kt<i“''V(ti - 1)[1 - y6s<^“''^E(/7('-"))], 
and, so, deduce that the asset’s accounting price is 

Pt = 9Vt/9Kt = (1 - s)(i-")KrV[l - ;6s(^-'')E(/l(i-''))]. (130) 

How would changes in the distribution of /x^ (J" > t) affect pt? To study this, 
imagine that log(/Xt) is normally distributed with mean m and variance a^. Denote 
the mean of jit by jl In that case, we know that 

/X = exp(m + ct^/2), (131) 

E(/7(i-")) = exp(-?7(l - T])a^/1), (132) 

and var(/x) = /x^[exp(o-^) — 1]. (133) 

Erom (130)-(133) we confirm that, holding var(/x) constant, dpt/d/x < 0. To 
study the effect of an increase in var(/x) on pt, while keeping /x constant, we must 
allow a to increase in such a way that (m + ct^/ 2) remains unchanged. It is now a 
simple matter to confirm that 9pt/9(a^) > 0. And so, we have 

Theorem 7 

Other things the same, (i) if the expected return on investment were to increase, 
the assets’ accounting price would decrease, and (ii) if the underlying risk in the 
asset’s productivity were to increase, so would its accounting price increase. 

Part (i) of Theorem 7 says that an increase in the expected rate of return on invest- 
ment would lead to a decrease in the asset’s accounting price, other things the same. 
But Part (ii) is also consistent with intuition. Prom (128) we know that utility, while 
bounded above, is unbounded below. We would then expect Vt to be particularly 
sensitive to the downside risk in jl. Part (ii) of Theorem 7 says that if the risk in 
/X were to increase, the asset (at the margin) would become more valuable - other 
things the same. The Theorem’s message should be expected to be even stronger 
if the underlying transformation possibilities among goods and services were to 
display thresholds, or, more generally, ecological non-convexities of the kind that 
is present in the model of the shallow lake (Section 4).^^ 

Of course, consumers could be expected to respond to an increase in the mean 
return on investment, or to an increase in uncertainty in the return. What would 
be their response? We cannot tell unless we model the economic environment in 
which various parties make their saving decisions. The simplest place to look is 
an environment where the saving rate is optimal. There, people’s response to a 
change in risk is also optimal. Eevhari and Srinivasan (1969) have shown that in 
the model economy being studied here, if U is homogeneous of degree (1 - 17) in 
C, the optimal saving ratio (s*) is the solution of the equation, 

s" = fEijl^^-’’^). 



(134) 
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Let US continue to assume that r] > \. From (130) and (134) we conclude that if the 
saving rate is optimal, then, other things the same, an increase in the expected return 
on investment leads to a decline in the accounting price of capital (i.e., dpt/d/x < 
0), and an increase in the riskyness of return leads to an increase in the accounting 
price (i.e., dpt/da^ > 0).^^ 

Accounting prices of capital assets (as opposed to their market prices) are 
rarely estimated; but when they are, the estimates are mostly made on the basis of 
economic models that eschew uncertainty. The general moral of our finding here is 
that such studies underestimate the social worth of those assets. 



15. Concluding Remarks 

In this paper we have explored the way welfare analysis can be conducted in imper- 
fect economies. In Sections 2-3 it was confirmed fhaf fhe same sef of accounfing 
prices should be used bofh for fhe evaluafion of policy reforms (e.g., projecf evalu- 
ation) and for assessing whefher fhe economic programme being pursued susfains 
infergenerafional welfare. In Sections 5-14 we sfudied fhe properfies of accounting 
prices of environmenfal nafural resources under a variety of insfifulional arrange- 
menfs. We showed fhaf for a number of cases if is possible fo derive simple 
formulae for accounting prices. If was found fhaf under plausible values of fhe 
relevanf parameters, accounting prices of goods and services can be subslanfially 
differenf from fheir markef prices. 

A large empirical liferafure in ecology and epidemiology offers evidence fhaf 
ecological processes are driven by non-convex fransformafion possibilities."^^ We 
nofe here in passing fhaf mefabolic processes also involve non-convex funclional 
relafionships befween nufrifion infake and nufrifional sfafus."^^ If was confirmed 
fhaf accounfing prices can be used in non-convex environmenfs (Secfion 4). Our 
hope is fhaf fhe mefhods developed here will be of use nol only in environmental 
and resource economics (our focus of concern here), but also in nutrition and 
epidemiological studies. 
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Notes 

1. Dasgupta and Maler (2000), Dasgupta (2001a, b), and Section 2. 

2. For references to the technical literature on sustainable development, see Pezzey and Toman 

(2002). 

3. Dasgupta and Maler (2000), Dasgupta (2001a, b), and Section 2. 

4. Serageldin (1995) and Pearce, Hamilton and Atkinson (1996) were early explorations of the 
practicalities of estimating a nation’s comprehensive wealth. 

5. The material in Section 2 has been taken from Dasgupta and Maler (2000) and Dasgupta 
(2001a, b). 

6. In a companion paper (Arrow, Dasgupta and Maler 2003) we have developed criteria for 
identifying sustainable development under changing population size in optimizing economies. 

7. Differentiability everywhere is a strong assumption. For practical purposes, however, it would 
suffice to assume that V is differentiable in K; almost everywhere. The latter would appear to 
be a reasonable assumption even when production possibilities (including ecological processes) 
are realistically non-convex. See Section 4 below. However, if the location of these points on 
the space of capital stocks is uncertain and the uncertainty a smooth probability distribution, the 
expected value of Vt would he continuous. 

8. It is not our purpose to review the several ways in which sustainable development can be, and 
has been, defined. Pezzey (1992) contains an early, but thorough, classification. 

9. For convenience we have defined sustainability only for a moment in time. One could insist on 
the infinitely more demanding requirement: dVt/dt > 0 for all t. Readers can confirm that our 
results can be rephrased in the obvious manner to be in accordance with this stiffer condition. 

10. One of us (KJA) has produced an example of an optimum economic programme displaying the 
latter feature. 

11. Pearce and Atkinson (1993) noted this result for optimizing economies. 

12. Theorem 4 is, of course, familiar for economies where the government maximises social welfare 
(see e.g.. Arrow and Kurz 1970). 

13. If the project has been designed efficiently, we would have: 

AYt = OF/9K)3Kt + (9F/3L)ALt + (9F/9R)ARt, 
where F is an aggregate production function (Y = F(K, L, R)). The analysis that follows in the 
text does not require the project to have been designed efficiently. As we are imagining that 
aggregate labour supply is fixed, ALt used in the project would be the same amount of labour 
displaced from elsewhere. 

14. Dasgupta, Marglin and Sen (1972) and Little and Mirrlees (1974), respectively, developed their 
accounts of social cost-benefit analysis with consumption and government income as numeraire. 
Which numeraire one chooses is, ultimately, not a matter of principle, but one of practical 
convenience. 

15. Thus 

oo 

U'(Cr)9Cr/9Rre“^(’^“‘)dT. 

Notice that if manufactured capital were to depreciate at a constant rate, say y, the social cost of 
borrowing capital would be kt = 9 4- y — (dpt/dt)/pt. 

Let qt be the accounting price of the resource in situ. At a full-optimum, pt9F/9Rt = qt = qt, and 
U'(Ct) = pt. 

16. To prove (16) notice that, by definition, pt satisfies the equation 

U'(Ct) exp(-9t) = U'(Co) exp(- f p,dr). 

Jo 

If we differentiate both sides of the above equation with respect to t, (16) follows. 

17. Notice that in imperfect economies 9 and rj may be unobservable. See Section 2.2. 
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18. Person-specific factors (e.g., age, health status, gender) can be included in the welfare function. 
This is routinely done in applied economics. 

19. See the interchange between Johnson (2001) and Dasgupta (2001c) on this. For a more detailed 
analysis of the connection between environmental and resource economics and the economics 
of poverty, see Dasgupta (1982, 1993, 2000, 2003). 

20. Estimates of the elasticity of marginal utility obtained from consumer behaviour, or, alternatively, 
from consumer responses to questions, have typically been in the range 1.5-2. 5. The evidence 
thus acquired does not of course reflect what we mean by rj here, hut it is close enough. 

21. These are not fanciful figures. Per capita consumption in a number of countries in sub-Saharan 
Africa declined over the past three decades at as high a rate as 1 percent per year, implying that 
for small values of 5, the consumption rate of interest would have been negative. 

22. As the economy has a single asset. Theorem 3 is trivially true. 

23. A special case of formula (32) appears in Dasgupta, Marglin and Sen (1972). However, unlike 
our present work, the earlier publication did not provide a rigorous welfare economic theory for 
imperfect economies. 

24. Solow (1974) and Hartwick (1977) are the key articles on this limiting case. 

25. For the ecology of shallow lakes, see Scheffer (1997) and Carpenter, Ludwig and Brock (1999). 

26. Note too that because the resource allocation mechanism is imperfect, -Uc ^ 9V/9S (see 
Section 4.2). 

27. Although, for ease of exposition, we are using the same notation, the points Sj, S 2 , and S 3 here 
are not the same as the points Sj, S 2 , and S 3 in the previous sub-section. 

28. To the best of our knowledge, Kurz (1968) was the first to note that if utility depends directly 
on capital stocks, the optimality conditions may possess multiple stationary points even in a 
convex world. Skiba (1978) showed that in non-convex economies the optimality conditions 
may possess multiple stationary points even if the utility function is independent of stocks. The 
model of Brock and Starrett (2003) combines the two features. 

29. Brock and Starrett (2003) refer to S as a Skiba point, the reference being to Skiba (1978). 

30. McKelvey (1980) has studied a special case of the model of diffusion developed below. 

31. In the limit, as k tends to infinity, yS tends to infinity, implying that depletion is instantaneous. 

32. See also Hartwick and Hageman (1993) for a fine discussion that links El Serafy’s formula to 
Hicks’ formulation of the concept of national income (Hicks 1942). 

33. That the industry optimizes does not mean that the economy is following an optimum 
programme. 

34. It should be noted though that the value of urban land would be more than just the new invest- 
ment: there is a contribution to the value (which could be of either sign) arising from changes in 
population density - both in the newly developed property and in places of origin of those who 
migrate to the property. 

35. Social cost-benefit analysis, as sketched in Section 2.4, would enable a country to estimate 
whether it ought to alter its emissions. Nordhaus and Yang (1996) have studied international 
carbon emissions as the outcome of a non-cooperative equilibrium game among nations. 

36. Asheim (1996), Sefton and Weale (1996), Vincent, Panayotou and Hartwick (1997), Aronsson 
and Ldfgren (1998), and Cairns (2002) have published related findings, but in the context of 
optimising economies. 

37. In this case the resource will be exhausted in finite time. For notational simplicity, we continue 
to present matters as though the horizon is infinite. 

38. For a discussion of such problems and possible resolutions to the paradoxes that normative 
population theory has given rise to, see Dasgupta (2001b). 

39. If Nt is a logistic function, n(Nt) = A(N* - Nt), where A and N* are positive constants. 

40. See Dasgupta (2001b) for a justification of this form of intergenerational welfare. 

41. In Dasgupta (2001b) Theorem 6 was invoked to assess whether the world’s poorest regions have 
experienced sustainable development in the recent past. 
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42. The reader can confirm that if 0 < ?; < 1 in (128), then dpt/d/t > 0 and dpt/dcr^ < 0; and if 
r] = 1 (i.e., U(C) = log C), then dpt/d/r = dpt/dcr^ = 0. See the following footnote for an intuitive 
explanation for these results. 

43. The reader can confirm that if 0 < ?? < 1 in (128) and (134), then dpt/d/t > 0 and dpt/dcr^ < 0. 
To understand the result, note that if 0 < ?; < 1, then U is unbounded above, but bounded below. 
ri = I corresponds to the case where U(C) = log C. In this case s* is independent of both and 

and so dpt/d/t = dpt/dcr^ = 0. The opposite pulls arising from the unboundedness of U at both 
ends cancel each other. See Hahn (1970) for an intuitive explanation for the way rj influences the 
relationship between a and s*. 

44. See for example, Murray (1993). 

45. On this see Dasgupta (1993). 
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